Gene STER_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0809 
Symbol 
ID4438157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp743309 
End bp745027 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content38% 
IMG OID639676498 
Productpara-aminobenzoate synthetase component I 
Protein accessionYP_820252 
Protein GI116627633 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAGA AAACCGTTAT TGATTTTAAA GAACTTGGCG TCAGACAAAT CTTCACTCAC 
GCCACAAAAG AGATAAAAAC CAAAGACATT AAGGAAGTTA AATCACTTAT AAATCAAATA
GAAGCCTATC AAGAAAAAGG CTATTTTGCT GTAGGCTATG TAGCCTATGA AGCTTCTCAG
GCCTTTGAAC CTAAATTTCA AATTTTTGAT AGCCCATTAA TGTCAGAGTA TCTTCTCTAT
TTTACTATTC ACGATACTGT TCAAACAGAG TCTATCCCTC TTGCTTATGA GCCTGTTCCC
TTACCAGAAT CTTGGCAAGA ACTAACTTCT GCAGAGGAAT ACAAGGCTGC TATTGAGCAT
ATACACCACC ATATTCGTCA AGGAAACACC TACCAGGTCA ATTTTACCGT CCAACTTCAA
CAGAACATAA CAGCTGATCC ATTTGCCATC TACAACCGAT TGGTTGTTGA GCAAAATGCA
CATTACAATG CCTTTATTCA ACATGATGAT GTCTCCATCA TTTCCATAAG TCCTGAACTC
TTCTTTAAAA AAGATGGTGA TATATTGACC ACACGTCCTA TGAAAGGGAC AACAAATCGT
GGCTTGACAA CTGAAACTGA CCTTAAACAA GCACAATGGC TTGCTCATGA TCAGAAAAAT
CGCTCTGAAA ATATGATGAT TGTAGATCTT CTTAGAAATG ACATGAATCG TATTTCAAAA
ATAGGGAGTG AAAATGTAAA AAGACTTTGC CAGGTTGAAC AATACTCTAC TGTTTGGCAA
ATGACTTCAA CTATTGAGAC GCAACTCCTA CCAAACAGTC GTTTGGATGA CATCTTCCAA
GCCCTTTTTC CTTGTGGATC TATTACAGGA GCACCAAAAA TAGCTACTAT GGCAATTATT
AAAAACGTCG AAAAACAAGC TCGAGGCGTC TATTGTGGAG CCATTGGTAT CTTGCTACCT
AATGGACCAA CTATTTTCAA CGTAGCCATC CGAACACTTC AAATGCAGGG AAACAATGCT
ATATATGGAG TAGGCGGTGG AATCACCTGG GACAGCAAAT GGGAAGCTGA ATATGAAGAA
ACAAGGCAAA AATCAGCTAT TCTATACCGT CAAAATCCTA GATTTGATCT TATCTCAACT
GGACGGATTC ATCAAGGTAA ACTACTCCAT CTTAAAGAAC ATCTCAATCG TCTACAAGAG
TCCAGTCGCT ATTTTGCTTA TCCTTTCAAT AAAAAAGAAG TTCAAAATCA AGTCGAAGAT
TTGTGTCAGT CCCTTGATTT TGACACAGAC TACCGTCTTA AATTGTCCCT TGCAAAAGAT
GGTAAACTTA CTTTTGAACA TGCTCAATTA ACAGAATTAG ACGATGATTT TTGTCAAGCA
AGATTAGTTA AGCAAACACA TCCTTTGAAT AACCCCTATA CCTACTTTAA AACAAGTTAT
CGACCACACA TTAGTCTAGG ACCTCATGAG CAAATCTACT ATAATCAAAA GAAAGAACTT
TTAGAAACTT CTATCGGTAA CCTCGTTCTT AAAATCAAGG ACCAACTCTA CACTCCACCT
GTTCACCTCG GTCTTTTAAA CGGTATTTAC AGACAAAGCC TCATTGCTAA TAATCAGGTC
ACAGAGAAAG TTTTGACTCT GGAAGATTTA AAACAGGCTC AAGCCATCTA TGGCTGTAAT
GCTGTGAGAG GGTTGTATGA ATTGAGGGTA GATTTCTAA
 
Protein sequence
MHKKTVIDFK ELGVRQIFTH ATKEIKTKDI KEVKSLINQI EAYQEKGYFA VGYVAYEASQ 
AFEPKFQIFD SPLMSEYLLY FTIHDTVQTE SIPLAYEPVP LPESWQELTS AEEYKAAIEH
IHHHIRQGNT YQVNFTVQLQ QNITADPFAI YNRLVVEQNA HYNAFIQHDD VSIISISPEL
FFKKDGDILT TRPMKGTTNR GLTTETDLKQ AQWLAHDQKN RSENMMIVDL LRNDMNRISK
IGSENVKRLC QVEQYSTVWQ MTSTIETQLL PNSRLDDIFQ ALFPCGSITG APKIATMAII
KNVEKQARGV YCGAIGILLP NGPTIFNVAI RTLQMQGNNA IYGVGGGITW DSKWEAEYEE
TRQKSAILYR QNPRFDLIST GRIHQGKLLH LKEHLNRLQE SSRYFAYPFN KKEVQNQVED
LCQSLDFDTD YRLKLSLAKD GKLTFEHAQL TELDDDFCQA RLVKQTHPLN NPYTYFKTSY
RPHISLGPHE QIYYNQKKEL LETSIGNLVL KIKDQLYTPP VHLGLLNGIY RQSLIANNQV
TEKVLTLEDL KQAQAIYGCN AVRGLYELRV DF