Gene Ent638_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2204 
Symbol 
ID5112896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2394351 
End bp2395913 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content57% 
IMG OID640492391 
Productanthranilate synthase component I 
Protein accessionYP_001176930 
Protein GI146311856 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00459304 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACCG CCAAAACAAA ACTTGAGTTG CTGACCTGCG AAGCAATCTA TCGCCACAAC 
CCAACCGCCT TGTTTCAGCA AGTCTGCGGT GCACGCCCCG CCACGCTGCT GCTGGAATCC
GCCGATATCG ATAGCAAAGA CGATCTCAAA AGCCTGCTGC TGGTAGACAG CGCATTGCGG
ATCACGGCCT TAGGTGACAC TGTCACCATC CAGGCGTTAT CAGAAAACGG TGCGTCGCTG
CTGCCGCTGC TGGATGCCGC TCTGCCCTCT GGCATCGAAA ATGAAAAACG TCCGCAAGGC
CGCACACTGC ATTTCCCAGC GGTAAGCCAA CTGCTGGATG AAGACGCGCG TCTGTGTTCG
CTGTCCGTTT TTGATGCCTT CCGCTTACTG CAAAATCTGG TGGATGTACC TGAGGACGAG
CGCGAAGCGA TGTTCTTTGG CGGGCTGTTT GCCTATGATC TGGTTGCCGG ATTCGAAAAC
TTACCCGAAA CCGAGCAAGG CAACCGCTGC CCGGATTACT GCTTCTATCT GGCAGAAACC
CTGATGGTGA TTGACCATCA GAAGAAATAC ACCCGCATTC AGGCCAGCCT CTTTACGCCT
TCCGCTGCTG AAAAACAGCG CCTTGCACAG CGTATCGAAC AGCTGCAACA GCAGATGACG
GAAGAACCGA CTGCGCTACC GGTGCAAAGC ATCGAGCATA TGCAGTGTGA AGTGAGCCAG
ACGGACGATC AGTACGGCGC GGTTGTCCGC CAGATGCAAA AAGAAATTCG CGCAGGCGAG
ATTTTCCAGG TGGTGCCGTC GCGTCGCTTC TCACTCCCGT GCCCTTCTCC GCTGGCAGCG
TATGACGTGC TGAAGAAAAG CAATCCGAGC CCGTACATGT TCTTTATGCA GGACAACGAG
TTCACGCTGT TTGGCGCATC GCCTGAAAGC TCACTGAAAT TCGATGCGAC CAGTCGTCAG
ATTGAGATCT ACCCGATCGC CGGGACGCGT CCACGCGGTC GTCGCGCGGA TGGTTCACTG
GACCGCGATC TTGACAGCCG CATCGAGTTA GAAATGCGTA CCGACCACAA AGAGCTCTCC
GAGCACCTGA TGCTGGTTGA CCTGGCGCGT AACGATCTGG CGCGCATCTG TACGCCGGGC
ACCCGTTACG TCGCAGATTT AACCAAAGTT GACCGCTACT CCTTCGTGAT GCACCTCGTT
TCACGCGTTG TGGGCGAGCT ACGCCACGAT CTCGATGCGC TGCACGCCTA CCGCGCCTGC
ATGAATATGG GCACCCTGAG CGGCGCGCCA AAAGTGCGTG CGATGCAACT CATCGCTGGC
GCCGAAGGCC GTCGTCGTGG CAGTTACGGT GGCGCAGTCG GGTACTTTAC CGCTCATGGC
GATCTGGATA CCTGCATCGT GATCCGCTCG GCCTACGTTG AAGACGGCAT TGCCACCGTC
CAGGCAGGTG CTGGCATCGT TCTCGATTCT GTTCCGCAAT CTGAAGCTGA CGAAACTCGC
AGTAAAGCTC GCGCGGTCTT GCGCGCTATC GCCACCGCAC ACCACGCACA GGAGATTTTC
TGA
 
Protein sequence
MQTAKTKLEL LTCEAIYRHN PTALFQQVCG ARPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QALSENGASL LPLLDAALPS GIENEKRPQG RTLHFPAVSQ LLDEDARLCS
LSVFDAFRLL QNLVDVPEDE REAMFFGGLF AYDLVAGFEN LPETEQGNRC PDYCFYLAET
LMVIDHQKKY TRIQASLFTP SAAEKQRLAQ RIEQLQQQMT EEPTALPVQS IEHMQCEVSQ
TDDQYGAVVR QMQKEIRAGE IFQVVPSRRF SLPCPSPLAA YDVLKKSNPS PYMFFMQDNE
FTLFGASPES SLKFDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG TRYVADLTKV DRYSFVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAG AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS AYVEDGIATV
QAGAGIVLDS VPQSEADETR SKARAVLRAI ATAHHAQEIF