Gene EcSMS35_1376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1376 
SymbolpabB 
ID6144488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1363927 
End bp1365288 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID641616254 
Productaminodeoxychorismate synthase 
Protein accessionYP_001743434 
Protein GI170679636 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.661176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.079748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGT TATCTCCCGC TGTGATTACT TTACCCTGGC GTCAGGACGC CGCTGAATTT 
TATTTCTCAC GCTTAAGCCA CCTGCCGTGG GCGATGCTTT TACACTCCGG CTATGCCGAT
CATCCGTATA GCCGCTTTGA TATTGTGGTC GCCGAGCCAA TTTGCACTTT AACTACTTTC
GGTAAAGAAA CCGTTATTAG TGAAAGCGAT AAACGTACAA CGACCACTGA TGACCCGCTA
CAGGTGCTCC AGCAGGTGCT GGATCGCGCA GACATTCACC CAACGCATAA CGAAGATTTG
CCATTTCAGG GCGGCGCGCT GGGGTTGTTT GGCTACGATC TGGGCCGCCG TTTTGAGTCA
CTGCCAGAAA TTGCGCAGCA AGATATCGTT CTGCCGGATA TGGCAGTGGG TATCTACGAC
TGGGCGCTGG TTGTTGACCA CCAGCGTCAA ACAGTTTCTT TGCTGAGTCA TAATGATGTC
AATGCTCGTC GGGCCTGGCT GGAAAGCCAG CAATTCTCGC CGCAGGAAGA TTTCACGCTC
ACTTCCGACT GGCAATCCAA TATGACCCGC GAACTGTACG GCGAAAAATT TCGCCAGGTA
CAGGAATATC TGCACAGCGG TGATTGCTAT CAGGTGAATC TCGCCCAGCG TTTTCATGCG
ACCTATTCTG GCGATGAATG GCAGGCATTC CTTCAGCTTA ATCAGGCCAA CCGCGCGCCG
TTTAGCGCTT TTTTACGCCT TGAACAAGGC ACGATTTTAA GCCTTTCGCC AGAGCGGTTT
ATCCTTTGTG ATAACAGTGA AATCCAGACC CGCCCGATTA AAGGCACGCT ACCACGCCTG
CCCGATCCTC AGGAAGATAG CAAACAAGCA GAGAAACTGG CGAACTCAGC GAAAGATCGT
GCCGAAAATC TGATGATTGT CGATTTAATG CGTAATGATA TCGGTCGTGT TGCCGTAGCA
GGTTCGGTAA AAGTACCAGA GCTGTTCGTG GTGGAACCCT TCCCTGCCGT GCATCATCTG
GTCAGCACCA TAACGGCGCA ACTACCAGAA CAGTTACACG CCAGCGATCT GCTGCGCGCG
GCTTTTCCTG GTGGCTCAAT AACTGGGGCT CCGAAAGTAC GGGCTATGGA AATTATCGAC
GAACTGGAAC CGCAGCGACG CAATGCCTGG TGCGGCAGCA TTGGCTATTT GAGCTTTTGC
GGCAACATGG ACACCAGCAT TACTATCCGC ACGCTGACTG CCATTAACGG ACAAATTTAC
TGCTCTGCGG GGGGTGGAAT TGTCGCCGAT AGCCAGGAAG AAGCGGAATA TCAGGAAACT
TTTGATAAAG TTAATAAGAT ATTACGCCAA CTGGAGAAGT AA
 
Protein sequence
MKTLSPAVIT LPWRQDAAEF YFSRLSHLPW AMLLHSGYAD HPYSRFDIVV AEPICTLTTF 
GKETVISESD KRTTTTDDPL QVLQQVLDRA DIHPTHNEDL PFQGGALGLF GYDLGRRFES
LPEIAQQDIV LPDMAVGIYD WALVVDHQRQ TVSLLSHNDV NARRAWLESQ QFSPQEDFTL
TSDWQSNMTR ELYGEKFRQV QEYLHSGDCY QVNLAQRFHA TYSGDEWQAF LQLNQANRAP
FSAFLRLEQG TILSLSPERF ILCDNSEIQT RPIKGTLPRL PDPQEDSKQA EKLANSAKDR
AENLMIVDLM RNDIGRVAVA GSVKVPELFV VEPFPAVHHL VSTITAQLPE QLHASDLLRA
AFPGGSITGA PKVRAMEIID ELEPQRRNAW CGSIGYLSFC GNMDTSITIR TLTAINGQIY
CSAGGGIVAD SQEEAEYQET FDKVNKILRQ LEK