Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2407 |
Symbol | arnB |
ID | 6143266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2455289 |
End bp | 2456446 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617280 |
Product | UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase |
Protein accession | YP_001744452 |
Protein GI | 170683278 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0399] Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAG GAAAAGCAAT GTCAGAATTT TTGCCTTTTT CGCGACCAGC AATGGGCGTG GAGGAACTCG CTGCAGTTAA AGAGGTTCTC GAATCCGGTT GGATCACAAC CGGTCCGAAG AATCAGGCGC TTGAACAAGC CTTTTGCCAG TTGACGGGAA ATCAGCATGC CATCGCGGTC AGTTCAGCCA CCGCCGGAAT GCATATCACG CTAATGGCGT TGGAAATAGG CAAGGGCGAT GAAGTGATTA CGCCTTCCCT GACCTGGGTT TCAACCCTCA ATATGATCTC CTTGTTGGGT GCAACGCCGG TAATGGTGGA TGTCGACCGC GATACGCTGA TGGTCACGCC TGAAGCGATC GAGTCAGCCA TTACGCCACG CACTAAAGCC ATCATTCCGG TGCATTATGC CGGTGCGCCA GCAGATATTG ACGCCATTCG CGCCATTGGC GAACGTTACG GCATCGCAGT TATCGAAGAT GCTGCCCATG CCGTCGGTAC GTATTACAAA GGGCGACATA TTGGCGCAAA AGGTACCGCT ATTTTTTCAT TTCATGCCAT TAAAAATATT ACCTGTGCTG AAGGTGGCCT GATTGTAACT GATAATGAAA ACCTTGCCCG CCAGCTACGG ATGCTGAAAT TTCACGGTCT GGGTGTCGAT GCCTATGACA GACAAACCTG GGGCCGTGCA CCGCAAGCTG AAGTCTTAAC ACCGGGCTAT AAGTACAATC TGACCGATAT TAACGCCGCG ATTGCCCTGA CACAGTTAGC AAAATTAGAG CACCTCAACA CCCGTCGGCG CGAAATTGCC CAGCAATATC AGCAAGCACT GGCAGCTCTC CCCTTTCAGC CATTAAGCCT TCCCGCCTGG CCGCACGTTC ACGCCTGGCA TCTGTTTATT ATTCGTGTCG ATGAACAACG TTGTGGTATC AGTCGCGATG CGTTGATGGA AGCGTTAAAA GAAAGAGGTA TTGGTACCGG GTTACATTTC CGCGCCGCTC ACACACAAAA ATATTATCGC GAGCGTTTTC CCACGCTGTC GTTACCGAAT ACCGAATGGA ATAGCGAACG CATCTGTTCG TTGCCGCTGT TCCCGGATAT GACTACCGCC GATGCCGACC GCGTCATCAC AGCCCTTCAG CAACTCGCAG GACAATAA
|
Protein sequence | MAEGKAMSEF LPFSRPAMGV EELAAVKEVL ESGWITTGPK NQALEQAFCQ LTGNQHAIAV SSATAGMHIT LMALEIGKGD EVITPSLTWV STLNMISLLG ATPVMVDVDR DTLMVTPEAI ESAITPRTKA IIPVHYAGAP ADIDAIRAIG ERYGIAVIED AAHAVGTYYK GRHIGAKGTA IFSFHAIKNI TCAEGGLIVT DNENLARQLR MLKFHGLGVD AYDRQTWGRA PQAEVLTPGY KYNLTDINAA IALTQLAKLE HLNTRRREIA QQYQQALAAL PFQPLSLPAW PHVHAWHLFI IRVDEQRCGI SRDALMEALK ERGIGTGLHF RAAHTQKYYR ERFPTLSLPN TEWNSERICS LPLFPDMTTA DADRVITALQ QLAGQ
|
| |