Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1796 |
Symbol | |
ID | 5877315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 1805818 |
End bp | 1807668 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641542146 |
Product | extracellular solute-binding protein |
Protein accession | YP_001663417 |
Protein GI | 167040432 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000801458 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AAATTAAAAA ATTTATTGCA CTGCTTGTAA TTGTTTTCTT TACAGCAGGT ATTTTTGTGG GGTGTGGGCA GTCAAATTCT ACTCAACAGG AATCCCAAGA AGCAAAACAA CAAGAAGAGG TAAAGAAGAC AAGAGAACTT AGACTTGCGA CGGATTGGCC CTATCCTTTC CACGGTAACC CATTTGGCCC AGGAGGTATA GGAGGAGCAT GGTGGTTTGC TTATGAGCCT TTTGCCTATT ATATTCCTCA AACAGGGGAA TACATTCCGC GCTTAGCTGA AAGTTGGAAA GTAGAAGGAA ATAAGGTTAC AGTAAATCTT AGGAAAGATG CAAAGTTCAG CGATGGCGAA CCTTTTACCT CAAAAGATGT GATAAATACG GTGAATTTAA TACAGGCTAT GTGGCAATGG CCTTATGACA TTGAATCTGT AGAGGCTCCA GATGACAATA CAGTTATTTT TACCCTTTCA AAGACAGCTT CCTCTTCTTT TGTACACACA CTGCTTACAG ATGGAGCTAT GGCATCCTTA GCGCCTGTAC ATGTATATGG GGATTTTGCT AAATCTGCTC AGGAAGTGGC TGATTTAGGG AAAAAGATAT TTTACTTACA AACAGAAGGC AAAACTGTGC CAGAGGATAT GAAAGCAGAG TATGATAAAA AATCAGACGA ATTAAGAAAA CAAGTGAATG ATTTTTCACC CTTTAAGACA TTAGGAAAGC TACCGGTAGT TGGTGCTTTT GAACCTGTTA AAGTAACTCA ATCAGAAATG GTATTGGAGG CAAATAAATA CTATTGGGCT TATCCGCAAA TGAAAATTGA CAAAGTTGTG TTTAAGAAAT GGTCTTCAAA TGAATTTGTA TGGGCTTCAC TTATATCTAA TGAAATTGAT GCAGCTCATC CTTCTATGCC AAAAGATGTA GTAGAACAAC TTTCAACATT GAATCCAAAA TTAAATGTGC TTACTGTTTC TGACTTGTCC GACATAGCAT TAGTATTTAA TTTTAAGAAA CCGCTCTTTC AAGATCTGAA CTTAAGAAAA GCTATTGCTC ATATATTAGA TAGAGATAAA ATTAGGGATG TCTCTGTATG GCAAGCAAAT AGTTATGAAA ATTACGCTGA CGGTGTATTA AAGAGTATGG AAGCAAAATG GGTAACTCAA GATACATTGC AAAAACTTAC AAAATATAAC ACGGATGTGG CAGCAGCAGA GGAGATTTTA AAGAATGCAG GCTACAAAAA AGTAGGAGAT ACCTGGCAAC AACCTAACGG ACAACCAGTG GCTTTCACTT TATCTGTATA CGGACCTCAT AACGATTGGG TATTGGCTGC AAAGGAAGTA GTTCAACAAT TGAACAATTT TGGATTTAAA GTTGAAATGA AATTGATTCC TGAAGGTATG AGAGACCAAG TAATGAGAAG TGGAGATTAC GATGTAGCTA TTGAATTTGG TTCTGCATGG TGGGGTTATC CTCATCCTTT GACTGGGTAT CAGAGGTTGT ATGATGGAGA CGTTTCTGCT ATTACTAATT TCCCTGCAAA AGACAAATAT CAAACTCCAT GGGGAGAACT TTCCCCCTAT GATTTGACAC TTGAATTGCA GAAGAACCTG CAGGATGAGA ACAAGGCAAT GGAAATAATT CAGCAATTAG CCTATATTAC TAATGAGTAT TTGCCTGTGA TACCGCTATA TGAGAAAGTA CTGCCCATTT ATTACAATGA TGGTTATAGA GTTAAAGGAT GGCCTGCAGA AGATGATGCT ATATGGTCTT TAGCACCAGG TGGAATTGAA AGGGTATACG ATTTATTGAT TACTACAGGT AAATTAGTTC CAGCAAAATA A
|
Protein sequence | MSKKIKKFIA LLVIVFFTAG IFVGCGQSNS TQQESQEAKQ QEEVKKTREL RLATDWPYPF HGNPFGPGGI GGAWWFAYEP FAYYIPQTGE YIPRLAESWK VEGNKVTVNL RKDAKFSDGE PFTSKDVINT VNLIQAMWQW PYDIESVEAP DDNTVIFTLS KTASSSFVHT LLTDGAMASL APVHVYGDFA KSAQEVADLG KKIFYLQTEG KTVPEDMKAE YDKKSDELRK QVNDFSPFKT LGKLPVVGAF EPVKVTQSEM VLEANKYYWA YPQMKIDKVV FKKWSSNEFV WASLISNEID AAHPSMPKDV VEQLSTLNPK LNVLTVSDLS DIALVFNFKK PLFQDLNLRK AIAHILDRDK IRDVSVWQAN SYENYADGVL KSMEAKWVTQ DTLQKLTKYN TDVAAAEEIL KNAGYKKVGD TWQQPNGQPV AFTLSVYGPH NDWVLAAKEV VQQLNNFGFK VEMKLIPEGM RDQVMRSGDY DVAIEFGSAW WGYPHPLTGY QRLYDGDVSA ITNFPAKDKY QTPWGELSPY DLTLELQKNL QDENKAMEII QQLAYITNEY LPVIPLYEKV LPIYYNDGYR VKGWPAEDDA IWSLAPGGIE RVYDLLITTG KLVPAK
|
| |