Gene BBta_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4202 
Symbol 
ID5148773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4412733 
End bp4414043 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID640559028 
Productputative N-acetylmuramoyl-L-alanine amidase 
Protein accessionYP_001240165 
Protein GI148255580 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.415108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.44949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCCGG CGCAGGCTCA GGGCGTGCCG GCGGCGCAGA CTCCCGCGGT AAGCGTCACG 
CCAGGGTTTC CGATTGCGTC CGACGCGCGT CTGGCGGGCG ATTCTCGACA GACGCGATTC
GTGCTCGACC TCGACAAGGC GGTTCCGTTC CGCGCCTTTC TGCTCAGCGA GCCCTATCGC
GTCGTGGTCG ACCTGCCGCA GGTTAGCTTC CATCTTCCTG CGGGGGCCGG AACCAGCGGC
CGTGGCCTGG TGAAGGCATT CAGGTACGGC ATGGTCATGC CCGGCGGCTC CCGGATCGTG
TTCGATCTGA CGGGGCCGGC GCGAATCGCC AACTCTTCGG TGCTGGATGC CGCCAACGGG
CAGCCGCCCC GGCTCGTGCT GGAACTCGAG GAGGTCGACG AAGCGACCTT CGCGCGGGCG
ATCGGCACCG ACAACCGTCC GACGCTCAAG CCCAGCTTGG CCGATTCGAG CCCTTCGGCG
GCGCTGGCGG CGGCGTCTCC CAATCCGTTG GCGGTGCGCC CCCCGCCGAC CACGCCTGCA
GGAGCTCCGG CGGCGCCAAA TGCGCCCGCG GACACGCGGC CGGTGGTGGT GATCGATCCG
GGACATGGCG GTATCGACTA TGGAACCCAG GGCGGCGAGA TCCCGGAAAA GAATCTGGTG
CTGGCGTTCG GACAGGCGCT GCGCGATCGC CTCGAGCGCG CGGGCAAGTT CCGCGTCGTC
ATGACCAGGG ATGATGACAC CTTCATTCCG CTCGGCGACC GGGTCCGCAT CGCCCAGAAG
CAGGGCGCGG CCCTGTTCGT GTCGATCCAT GCCGACGCGT TACCGCGCGG CGAGGGCGAT
GCGCAGGGCG CGACGATCTA CACGCTGTCG GACAAGGCCT CCGACGTCGA AGCGGAGCGG
CTCGCCGAGG CCGAGAACCG CGCCGACCTG ATCGGTGGCG TCAACCTGAC CGAGGAGCCA
GCCGACGTGG CGGATATTCT GATCGATCTG GCCCGGCGCG AAACCCGCAC CTTTTCCAAC
AGGTTCGCCC GCGTCTTGAT GAACGACATG AAGACGGCGG TGCATATGCA TAAGCGGCCG
TTGAAATCGG CCGGGTTCCG CGTGCTGAAG GCGCCCGACG TGCCCTCGGT GCTGGTCGAG
CTCGGCTATG TCTCGAACAA GGGCGACATG GGGAACCTGG TGTCGGAAGC CTGGCGGTCG
CGCACGGCCG ACGCGATGGC CCGCGCGATC GATGTCTTCC TGACCAAACG CGTTGCAAAT
GCTGGTGCCG AGCGGCCGGA GAAGCCGGCG CCCGCCCGTG CAAAGCCCTA G
 
Protein sequence
MLPAQAQGVP AAQTPAVSVT PGFPIASDAR LAGDSRQTRF VLDLDKAVPF RAFLLSEPYR 
VVVDLPQVSF HLPAGAGTSG RGLVKAFRYG MVMPGGSRIV FDLTGPARIA NSSVLDAANG
QPPRLVLELE EVDEATFARA IGTDNRPTLK PSLADSSPSA ALAAASPNPL AVRPPPTTPA
GAPAAPNAPA DTRPVVVIDP GHGGIDYGTQ GGEIPEKNLV LAFGQALRDR LERAGKFRVV
MTRDDDTFIP LGDRVRIAQK QGAALFVSIH ADALPRGEGD AQGATIYTLS DKASDVEAER
LAEAENRADL IGGVNLTEEP ADVADILIDL ARRETRTFSN RFARVLMNDM KTAVHMHKRP
LKSAGFRVLK APDVPSVLVE LGYVSNKGDM GNLVSEAWRS RTADAMARAI DVFLTKRVAN
AGAERPEKPA PARAKP