Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2204 |
Symbol | |
ID | 5112896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 2394351 |
End bp | 2395913 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640492391 |
Product | anthranilate synthase component I |
Protein accession | YP_001176930 |
Protein GI | 146311856 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00459304 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACCG CCAAAACAAA ACTTGAGTTG CTGACCTGCG AAGCAATCTA TCGCCACAAC CCAACCGCCT TGTTTCAGCA AGTCTGCGGT GCACGCCCCG CCACGCTGCT GCTGGAATCC GCCGATATCG ATAGCAAAGA CGATCTCAAA AGCCTGCTGC TGGTAGACAG CGCATTGCGG ATCACGGCCT TAGGTGACAC TGTCACCATC CAGGCGTTAT CAGAAAACGG TGCGTCGCTG CTGCCGCTGC TGGATGCCGC TCTGCCCTCT GGCATCGAAA ATGAAAAACG TCCGCAAGGC CGCACACTGC ATTTCCCAGC GGTAAGCCAA CTGCTGGATG AAGACGCGCG TCTGTGTTCG CTGTCCGTTT TTGATGCCTT CCGCTTACTG CAAAATCTGG TGGATGTACC TGAGGACGAG CGCGAAGCGA TGTTCTTTGG CGGGCTGTTT GCCTATGATC TGGTTGCCGG ATTCGAAAAC TTACCCGAAA CCGAGCAAGG CAACCGCTGC CCGGATTACT GCTTCTATCT GGCAGAAACC CTGATGGTGA TTGACCATCA GAAGAAATAC ACCCGCATTC AGGCCAGCCT CTTTACGCCT TCCGCTGCTG AAAAACAGCG CCTTGCACAG CGTATCGAAC AGCTGCAACA GCAGATGACG GAAGAACCGA CTGCGCTACC GGTGCAAAGC ATCGAGCATA TGCAGTGTGA AGTGAGCCAG ACGGACGATC AGTACGGCGC GGTTGTCCGC CAGATGCAAA AAGAAATTCG CGCAGGCGAG ATTTTCCAGG TGGTGCCGTC GCGTCGCTTC TCACTCCCGT GCCCTTCTCC GCTGGCAGCG TATGACGTGC TGAAGAAAAG CAATCCGAGC CCGTACATGT TCTTTATGCA GGACAACGAG TTCACGCTGT TTGGCGCATC GCCTGAAAGC TCACTGAAAT TCGATGCGAC CAGTCGTCAG ATTGAGATCT ACCCGATCGC CGGGACGCGT CCACGCGGTC GTCGCGCGGA TGGTTCACTG GACCGCGATC TTGACAGCCG CATCGAGTTA GAAATGCGTA CCGACCACAA AGAGCTCTCC GAGCACCTGA TGCTGGTTGA CCTGGCGCGT AACGATCTGG CGCGCATCTG TACGCCGGGC ACCCGTTACG TCGCAGATTT AACCAAAGTT GACCGCTACT CCTTCGTGAT GCACCTCGTT TCACGCGTTG TGGGCGAGCT ACGCCACGAT CTCGATGCGC TGCACGCCTA CCGCGCCTGC ATGAATATGG GCACCCTGAG CGGCGCGCCA AAAGTGCGTG CGATGCAACT CATCGCTGGC GCCGAAGGCC GTCGTCGTGG CAGTTACGGT GGCGCAGTCG GGTACTTTAC CGCTCATGGC GATCTGGATA CCTGCATCGT GATCCGCTCG GCCTACGTTG AAGACGGCAT TGCCACCGTC CAGGCAGGTG CTGGCATCGT TCTCGATTCT GTTCCGCAAT CTGAAGCTGA CGAAACTCGC AGTAAAGCTC GCGCGGTCTT GCGCGCTATC GCCACCGCAC ACCACGCACA GGAGATTTTC TGA
|
Protein sequence | MQTAKTKLEL LTCEAIYRHN PTALFQQVCG ARPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QALSENGASL LPLLDAALPS GIENEKRPQG RTLHFPAVSQ LLDEDARLCS LSVFDAFRLL QNLVDVPEDE REAMFFGGLF AYDLVAGFEN LPETEQGNRC PDYCFYLAET LMVIDHQKKY TRIQASLFTP SAAEKQRLAQ RIEQLQQQMT EEPTALPVQS IEHMQCEVSQ TDDQYGAVVR QMQKEIRAGE IFQVVPSRRF SLPCPSPLAA YDVLKKSNPS PYMFFMQDNE FTLFGASPES SLKFDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG TRYVADLTKV DRYSFVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAG AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS AYVEDGIATV QAGAGIVLDS VPQSEADETR SKARAVLRAI ATAHHAQEIF
|
| |