Gene EcSMS35_2407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2407 
SymbolarnB 
ID6143266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2455289 
End bp2456446 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID641617280 
ProductUDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase 
Protein accessionYP_001744452 
Protein GI170683278 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG GAAAAGCAAT GTCAGAATTT TTGCCTTTTT CGCGACCAGC AATGGGCGTG 
GAGGAACTCG CTGCAGTTAA AGAGGTTCTC GAATCCGGTT GGATCACAAC CGGTCCGAAG
AATCAGGCGC TTGAACAAGC CTTTTGCCAG TTGACGGGAA ATCAGCATGC CATCGCGGTC
AGTTCAGCCA CCGCCGGAAT GCATATCACG CTAATGGCGT TGGAAATAGG CAAGGGCGAT
GAAGTGATTA CGCCTTCCCT GACCTGGGTT TCAACCCTCA ATATGATCTC CTTGTTGGGT
GCAACGCCGG TAATGGTGGA TGTCGACCGC GATACGCTGA TGGTCACGCC TGAAGCGATC
GAGTCAGCCA TTACGCCACG CACTAAAGCC ATCATTCCGG TGCATTATGC CGGTGCGCCA
GCAGATATTG ACGCCATTCG CGCCATTGGC GAACGTTACG GCATCGCAGT TATCGAAGAT
GCTGCCCATG CCGTCGGTAC GTATTACAAA GGGCGACATA TTGGCGCAAA AGGTACCGCT
ATTTTTTCAT TTCATGCCAT TAAAAATATT ACCTGTGCTG AAGGTGGCCT GATTGTAACT
GATAATGAAA ACCTTGCCCG CCAGCTACGG ATGCTGAAAT TTCACGGTCT GGGTGTCGAT
GCCTATGACA GACAAACCTG GGGCCGTGCA CCGCAAGCTG AAGTCTTAAC ACCGGGCTAT
AAGTACAATC TGACCGATAT TAACGCCGCG ATTGCCCTGA CACAGTTAGC AAAATTAGAG
CACCTCAACA CCCGTCGGCG CGAAATTGCC CAGCAATATC AGCAAGCACT GGCAGCTCTC
CCCTTTCAGC CATTAAGCCT TCCCGCCTGG CCGCACGTTC ACGCCTGGCA TCTGTTTATT
ATTCGTGTCG ATGAACAACG TTGTGGTATC AGTCGCGATG CGTTGATGGA AGCGTTAAAA
GAAAGAGGTA TTGGTACCGG GTTACATTTC CGCGCCGCTC ACACACAAAA ATATTATCGC
GAGCGTTTTC CCACGCTGTC GTTACCGAAT ACCGAATGGA ATAGCGAACG CATCTGTTCG
TTGCCGCTGT TCCCGGATAT GACTACCGCC GATGCCGACC GCGTCATCAC AGCCCTTCAG
CAACTCGCAG GACAATAA
 
Protein sequence
MAEGKAMSEF LPFSRPAMGV EELAAVKEVL ESGWITTGPK NQALEQAFCQ LTGNQHAIAV 
SSATAGMHIT LMALEIGKGD EVITPSLTWV STLNMISLLG ATPVMVDVDR DTLMVTPEAI
ESAITPRTKA IIPVHYAGAP ADIDAIRAIG ERYGIAVIED AAHAVGTYYK GRHIGAKGTA
IFSFHAIKNI TCAEGGLIVT DNENLARQLR MLKFHGLGVD AYDRQTWGRA PQAEVLTPGY
KYNLTDINAA IALTQLAKLE HLNTRRREIA QQYQQALAAL PFQPLSLPAW PHVHAWHLFI
IRVDEQRCGI SRDALMEALK ERGIGTGLHF RAAHTQKYYR ERFPTLSLPN TEWNSERICS
LPLFPDMTTA DADRVITALQ QLAGQ