Gene Teth514_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1869 
Symbol 
ID5877358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1883557 
End bp1884984 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content35% 
IMG OID641542221 
Productanthranilate synthase component I 
Protein accessionYP_001663485 
Protein GI167040500 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAACA TTTCTAAAGA AGATTTTTGC GAACACAAAA AAAGAGGATA TGTCTTCCCA 
GTCTATGCAG AAATAAACGG AGATGAACTG ACACCAATAA ACATTTTTTA TAGTCTTAAA
GGCAAAAACA AATTTTTACT GGAAAGTGCA AATGGAGGCA CTAACTGGGG CAGGTACTCC
TTTATAGGAA AAGACCCTTA TTTATCAATT TTAAGTTATG GAAAAAGGAT AAAACTTATA
GGTGAAAGTG AAGAAGAAAA AGAAGGCTTA ATTCTTGATG AAATAAGAGA TATTATGAAT
TTTAAGTACA ATTCCTTAGG GCTTGATATC CCGTTTGTAG GTGGGGCTAT AGGTTATGCT
TCCTATGATC TGATTAGACT TTATGAGAAG CTACCAGATA AAAACCCTGA TGAAATAAAT
ATACCGGACG TATACTTTAT GTTTTATAAA AGTTTTATTT GCTATGACCA TCTTAAACAC
AGAATCTATG TTGTTTATAA TGTGTATCCT GAGGAAGACG TAGAATATGA AGAAGTTTTG
CAAAAAATTA ATGAACTTTT GCAAGAGGTA AAATCAAATG CTCCTCAATT TCATGACCTT
CCATCACAAC AAGAAAAGGA AATTTATTAC AATTTTACAA AAGAAGAGTT TTGTAAAATT
GTTGAAAAAG CAAAAGAATA CATCGAAAAA GGAGACATAT TCCAAGTAGT GTTGTCTCAA
AGGTTAAAAG CAGCAGTAAG CTCCCACCCT TTCGAGATAT ACAGAAGATT AAGGTCAAAA
AATCCATCTC CATATCTTTT TTACATCGAT TTTGGTGATT TTCAGCTTCT TGGTTCTTCA
CCTGAAAGCC TTGTAAGTGT TTTTGGAGAC AAAGTGACTA CAAATCCCAT TGCAGGCACA
AGGCGAAGAG GAAAAGATGA AGAAGAAGAT TTAAGACTTA AAGAGGAACT TTTAAAAGAT
GAAAAGGAAA GGGCAGAGCA TGTGATGTTA GTTGACCTTG GAAGAAACGA CATAGGAAAA
GTTAGTGAAT TTGGAAGTGT AAAAATAGAG CGTTTTATGG AAGTAGATTT TTACTCTCAT
GTAATGCATA TTGTATCGAC TGTTTCAGGA AAGTTAAAAA GAGGACTTAC GGCTTTTGAT
GCTCTTATAG CTTGTCTTCC TGCAGGTACA GTTTCTGGGG CACCAAAAAT AAGAGCGATG
GAAATAATAG ACGAACTTGA AAATGTGAGA AGGTCTTTTT ACGCAGGAGC TGTTGGATAT
TTTTCCTACA ATGGCAATAT GGACATGTGC ATAGCAATAA GGACTATTCT CTTCAAAGAA
GGTTATGCTT ACGTTCAAGC GGGAGCAGGC ATTGTATATG ATTCAATTCC TGAGATGGAA
TACTGTGAAA CTTTAAATAA GGCAATGGCT CTTAAGGAGG TTCTTTGA
 
Protein sequence
MVNISKEDFC EHKKRGYVFP VYAEINGDEL TPINIFYSLK GKNKFLLESA NGGTNWGRYS 
FIGKDPYLSI LSYGKRIKLI GESEEEKEGL ILDEIRDIMN FKYNSLGLDI PFVGGAIGYA
SYDLIRLYEK LPDKNPDEIN IPDVYFMFYK SFICYDHLKH RIYVVYNVYP EEDVEYEEVL
QKINELLQEV KSNAPQFHDL PSQQEKEIYY NFTKEEFCKI VEKAKEYIEK GDIFQVVLSQ
RLKAAVSSHP FEIYRRLRSK NPSPYLFYID FGDFQLLGSS PESLVSVFGD KVTTNPIAGT
RRRGKDEEED LRLKEELLKD EKERAEHVML VDLGRNDIGK VSEFGSVKIE RFMEVDFYSH
VMHIVSTVSG KLKRGLTAFD ALIACLPAGT VSGAPKIRAM EIIDELENVR RSFYAGAVGY
FSYNGNMDMC IAIRTILFKE GYAYVQAGAG IVYDSIPEME YCETLNKAMA LKEVL