Gene Teth514_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1796 
Symbol 
ID5877315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1805818 
End bp1807668 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content37% 
IMG OID641542146 
Productextracellular solute-binding protein 
Protein accessionYP_001663417 
Protein GI167040432 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000801458 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAATTAAAAA ATTTATTGCA CTGCTTGTAA TTGTTTTCTT TACAGCAGGT 
ATTTTTGTGG GGTGTGGGCA GTCAAATTCT ACTCAACAGG AATCCCAAGA AGCAAAACAA
CAAGAAGAGG TAAAGAAGAC AAGAGAACTT AGACTTGCGA CGGATTGGCC CTATCCTTTC
CACGGTAACC CATTTGGCCC AGGAGGTATA GGAGGAGCAT GGTGGTTTGC TTATGAGCCT
TTTGCCTATT ATATTCCTCA AACAGGGGAA TACATTCCGC GCTTAGCTGA AAGTTGGAAA
GTAGAAGGAA ATAAGGTTAC AGTAAATCTT AGGAAAGATG CAAAGTTCAG CGATGGCGAA
CCTTTTACCT CAAAAGATGT GATAAATACG GTGAATTTAA TACAGGCTAT GTGGCAATGG
CCTTATGACA TTGAATCTGT AGAGGCTCCA GATGACAATA CAGTTATTTT TACCCTTTCA
AAGACAGCTT CCTCTTCTTT TGTACACACA CTGCTTACAG ATGGAGCTAT GGCATCCTTA
GCGCCTGTAC ATGTATATGG GGATTTTGCT AAATCTGCTC AGGAAGTGGC TGATTTAGGG
AAAAAGATAT TTTACTTACA AACAGAAGGC AAAACTGTGC CAGAGGATAT GAAAGCAGAG
TATGATAAAA AATCAGACGA ATTAAGAAAA CAAGTGAATG ATTTTTCACC CTTTAAGACA
TTAGGAAAGC TACCGGTAGT TGGTGCTTTT GAACCTGTTA AAGTAACTCA ATCAGAAATG
GTATTGGAGG CAAATAAATA CTATTGGGCT TATCCGCAAA TGAAAATTGA CAAAGTTGTG
TTTAAGAAAT GGTCTTCAAA TGAATTTGTA TGGGCTTCAC TTATATCTAA TGAAATTGAT
GCAGCTCATC CTTCTATGCC AAAAGATGTA GTAGAACAAC TTTCAACATT GAATCCAAAA
TTAAATGTGC TTACTGTTTC TGACTTGTCC GACATAGCAT TAGTATTTAA TTTTAAGAAA
CCGCTCTTTC AAGATCTGAA CTTAAGAAAA GCTATTGCTC ATATATTAGA TAGAGATAAA
ATTAGGGATG TCTCTGTATG GCAAGCAAAT AGTTATGAAA ATTACGCTGA CGGTGTATTA
AAGAGTATGG AAGCAAAATG GGTAACTCAA GATACATTGC AAAAACTTAC AAAATATAAC
ACGGATGTGG CAGCAGCAGA GGAGATTTTA AAGAATGCAG GCTACAAAAA AGTAGGAGAT
ACCTGGCAAC AACCTAACGG ACAACCAGTG GCTTTCACTT TATCTGTATA CGGACCTCAT
AACGATTGGG TATTGGCTGC AAAGGAAGTA GTTCAACAAT TGAACAATTT TGGATTTAAA
GTTGAAATGA AATTGATTCC TGAAGGTATG AGAGACCAAG TAATGAGAAG TGGAGATTAC
GATGTAGCTA TTGAATTTGG TTCTGCATGG TGGGGTTATC CTCATCCTTT GACTGGGTAT
CAGAGGTTGT ATGATGGAGA CGTTTCTGCT ATTACTAATT TCCCTGCAAA AGACAAATAT
CAAACTCCAT GGGGAGAACT TTCCCCCTAT GATTTGACAC TTGAATTGCA GAAGAACCTG
CAGGATGAGA ACAAGGCAAT GGAAATAATT CAGCAATTAG CCTATATTAC TAATGAGTAT
TTGCCTGTGA TACCGCTATA TGAGAAAGTA CTGCCCATTT ATTACAATGA TGGTTATAGA
GTTAAAGGAT GGCCTGCAGA AGATGATGCT ATATGGTCTT TAGCACCAGG TGGAATTGAA
AGGGTATACG ATTTATTGAT TACTACAGGT AAATTAGTTC CAGCAAAATA A
 
Protein sequence
MSKKIKKFIA LLVIVFFTAG IFVGCGQSNS TQQESQEAKQ QEEVKKTREL RLATDWPYPF 
HGNPFGPGGI GGAWWFAYEP FAYYIPQTGE YIPRLAESWK VEGNKVTVNL RKDAKFSDGE
PFTSKDVINT VNLIQAMWQW PYDIESVEAP DDNTVIFTLS KTASSSFVHT LLTDGAMASL
APVHVYGDFA KSAQEVADLG KKIFYLQTEG KTVPEDMKAE YDKKSDELRK QVNDFSPFKT
LGKLPVVGAF EPVKVTQSEM VLEANKYYWA YPQMKIDKVV FKKWSSNEFV WASLISNEID
AAHPSMPKDV VEQLSTLNPK LNVLTVSDLS DIALVFNFKK PLFQDLNLRK AIAHILDRDK
IRDVSVWQAN SYENYADGVL KSMEAKWVTQ DTLQKLTKYN TDVAAAEEIL KNAGYKKVGD
TWQQPNGQPV AFTLSVYGPH NDWVLAAKEV VQQLNNFGFK VEMKLIPEGM RDQVMRSGDY
DVAIEFGSAW WGYPHPLTGY QRLYDGDVSA ITNFPAKDKY QTPWGELSPY DLTLELQKNL
QDENKAMEII QQLAYITNEY LPVIPLYEKV LPIYYNDGYR VKGWPAEDDA IWSLAPGGIE
RVYDLLITTG KLVPAK