Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Taci_0008 |
Symbol | |
ID | 8629818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermanaerovibrio acidaminovorans DSM 6589 |
Kingdom | Bacteria |
Replicon accession | NC_013522 |
Strand | + |
Start bp | 8644 |
End bp | 10278 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003316531 |
Protein GI | 269791627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00260142 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAAAA AGGCGGTCCT GCTCCTGTTG GCACTGGGCG TCATGTCCCT TATGTCCCTT CCGGCCCTGG CGGCGGATAA GCCGGTCTAC GGGGGAACCC TGGTCTACCG GGAAGCGTCG GATCCCCCCA AGATAGACCC GGCGTTCACC ACTGACACCA CCTCCGATCG GGCGTCCAAC CTGATATTCG AGACGTTGGT GATCAACTCC CCGGACGGCA AGAGGATCCT TCCCTGCCTT GCGGAGTCCT GGACCATAAA CAAGGACTCC ACGGTGTTCA CCTTCAAGCT CCGCAAGGGG GTCAAGTTCC ACAGCGTGAG CGAGGGGAAG CCAACGCTGA ACAAGGGCCG GGAGGTGAAG GCGGAGGACG TGAAGTACTC CTTCGAGCGT CTGGTCCGTC TCAAGAGCCC CCGGGCCTAC TTCGTGGAGC AGATAAAGGG CTACAAGGAG TTCACCGAGG GTAAGGCCAA GGAGTGGACC GGGATAAAGG TTCTGGACCC CTACACGGTG CAGTTCACCC TGGAGTACCC CTTCGCGCCG TTCCTTGCGG TGCTGGCGTA CCACGCGTTC AGCGTGGTTC CCCGGGAGGA CGCGGAGAAG TGGGGCAAGG ACTTCTCGTT CCACCCGGTG GGCACCGGTC CGTTCGTGTT CAAGGAGTGG AAGCACGACC AGCGTTTCGT GGTGGAGCGG AACCCCCATT ACTGGGGCAA GGATGCCCAG GGGAACAGGC TTCCCTACGT GGACAGGGTG GAGATCCGGA TCGTCCCGGA CAACTCGGTG GCGTGGCTGG AGTTCAAAAA GGGCAACATC GACATCCTGT CCGCCATCCC CAACGAGTAC TACAAGGAGT GCAAGGGCCT CTACGGTCCC AAGAAGCTGT TCGTGGAGCG TCCCGGGGTG GGCACCTTCT ACATCGGCAT GAACACCTCC AAGCCCCCCT TCAAGGACAA CGTGAAGCTC CGCCAGGCGC TGAACTGGGC CATCGACCGG CAGGCCATCT CGGACCTGAT CCTGAGCGGC CGGAACAGGC CAGTTAAGGG GGTCCTGCCC CCCAGCATGC CGGGCTTCAA CCCCAACCTG AAGGGCTACG GCTATGACCC CGCCAAGGCG AAGAAGCTCC TGGCGGAGGC GGGCTACCCC AAGGGGCTCA CAGTGGAGTT CCAGTTCAAC TCCGGGGGTA GCAACCGGCA GATAGCCGAG GCGGTCCAGG CCCAGCTATC CCAGATAGGG GTCAACGTGA AGCTGAAGGA GCTGGACTGG GGGGCGCACC TGGACATGTG CGACCGGGGT GAGACCCAGA TGTACCGGAT GACCTGGGTG GTGGACTACA TGGACCCGGA CAACTTCCTG TTCGTGAACC TCCACTCCTC CAATGCGGGC TCCAAGGGGA ACTACTCCTT CTACAAGAAC CCCAAGGTGG ACCGGCTCCT GTCCGAGGCC CGGCGGGAGT CCAACTGGAA CAAGCGGATG AAGCTCTACC AGGAGGCGGA GCAGCTGATC GTCAACGATG CCCCCTGGAT CTTCATGATG GCCACCAGCA GCAGCATGGT GCACCAGCCC AACGTGAAGA ACGTTGTCCT CCACGCCATG GGGGACTACA TGACGGACCT CTCCAGGGTC TGGAAGGTCA AGTGA
|
Protein sequence | MRKKAVLLLL ALGVMSLMSL PALAADKPVY GGTLVYREAS DPPKIDPAFT TDTTSDRASN LIFETLVINS PDGKRILPCL AESWTINKDS TVFTFKLRKG VKFHSVSEGK PTLNKGREVK AEDVKYSFER LVRLKSPRAY FVEQIKGYKE FTEGKAKEWT GIKVLDPYTV QFTLEYPFAP FLAVLAYHAF SVVPREDAEK WGKDFSFHPV GTGPFVFKEW KHDQRFVVER NPHYWGKDAQ GNRLPYVDRV EIRIVPDNSV AWLEFKKGNI DILSAIPNEY YKECKGLYGP KKLFVERPGV GTFYIGMNTS KPPFKDNVKL RQALNWAIDR QAISDLILSG RNRPVKGVLP PSMPGFNPNL KGYGYDPAKA KKLLAEAGYP KGLTVEFQFN SGGSNRQIAE AVQAQLSQIG VNVKLKELDW GAHLDMCDRG ETQMYRMTWV VDYMDPDNFL FVNLHSSNAG SKGNYSFYKN PKVDRLLSEA RRESNWNKRM KLYQEAEQLI VNDAPWIFMM ATSSSMVHQP NVKNVVLHAM GDYMTDLSRV WKVK
|
| |