Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_1114 |
Symbol | |
ID | 8645609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013525 |
Strand | - |
Start bp | 1202808 |
End bp | 1205999 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | 3D domain protein |
Protein accession | YP_003322852 |
Protein GI | 269926229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGCAG TAATGAGAAC CGTTATTGTA TTGCTTCTTT TATTGGCATT GGTAATGCCT ATATCTGCAA AGATGGCTCC TACAGCGCTA GCCGAAGAAT TAGGTGGAAA TGGTAGCTGG TATAGATACC AGGGGGAACC ATCTAGCTAT GACAGTCACA GCAATCAAAA CGTCAACTGT GGTCCCACCT CAGTAGCAAT GGCCATTCAA TATACCCAGA ACTTGTCAGT ACCTATCAGG GACATAAGGG ACTTTATAGG CAAGAATAAA AAATCCACTA ACCTCTCCGA TCTTACTAAG GCGCTCAGTC ACTGGAATGT TGCCTATAGG GCAGACATAT ATAACGTAAG CGATATCAAA GCAGCCCTTC AGCAGGGACA CATTATCATA GTAGCGTTGG ACATGCGCGC TATTAGCCCT GGAGCAGATC TAAATGGAGC TTCAGCTGAT CCAAGTATTA GGATAGGCAG ATTCGAATCA ATAGCCAGAG AACATTGGAT AGTGATAAAG GGCATAACTC CAGATGAGAA TTACTTTATA GTCTACGATG GCAACGTATG GGGAGGTCCT GGTAATCCAG TCTATTGGTA CAGCGATGGC ACACCCAAGG GCATGGACAG GTACTACGCC GTGAACGAAG TTGAAAAGGG CATGCGACAT TTTGGAGCTA ATACAATCAA GGGCATAGAA ATAATATCTT CTCGATTACC AGACCGAACC CTCAGAAACA CAGTAATCGC CAAAGTAACT TACTACACCC TGAGTCCGGA GGAGACTGGC AAGAATCCTG GGGATCCTGG ATGGGGGATA ATGCGCAATG GCAAAAAGGT TCATTGGGGT GCTGTAGCCG TAGACCCCAA CTACATTCCC CTGGGGACCA AGATGCTCAT AGATGGTTGG GAAGATCAAA TATTTGTAGC ATCTGATACA GGCTCGCAGG TAAAGGGATG GCAAATAGAT GTTTACTGGC CTGGTAGTAG GGAAGAAGCT CTCAGAAAGA ATGATGAACT GGGCGGGTGG CGTAAGATCA CCTTCATAGG TATGGATGCC CCAGTAAGTA CCTCTACTAC AACTCCCCCT GTAAATGCTT ACATCAATGC GCCCGAAACG ACTACTACAC GATGGATAAA TCTACAGTTA CATGCTGAGG ATCCTGAAGA AGGTGTAGTA GGCATGATGA TATCAAATAG CAAGAACTTT ACAGATGCCT TCGAGGAGCC TTACTCATCA ATTAAGGAAT GGACATTGCC TCCTGGAGAC GGTGAAAAAA CGGTATACGC TCGATTCAAA AACTCTGACG GGGCTTGGAG TAGCCTCGTA GAGGCACATA TACACCTGGA AGAAGAACCC CCTACCGGTG CCGTAACTTT GGCTCCTAAG CCAGGATTGG TGTCCTACAT TCCATTTAAC GGGAGTACCA TGGCAGTTAT AGGACCTCAA CCAAAGATTT CAGGTCCTGT AAGGTATGCA TCCATGAACA ATAACAAGCT AAGTAACTCA AGCTTTGAGC TATGGGCAGG GGGGATACCT AAAGATTGGG ACTCTCCACT GAGAGATGAG TCGTATGCTG CGTATGAGCC CAGTACGGAA GCGTTGGATG GCTCAACATC TCTATTTTCA AACTCAACCA AAGAAGGCTA TATTTACCAG ATAGTACCTG TAAGAGCCAA TACCAGTTAC ACACTAGCGA TAATGGCGAA AGGCAATAAT GGAGCCATTC AAATCCAAGA ATTGCAGACC ACGGGCACAA CAAGCCGTGT TCTCAAATCG CATACCGCTG GATTCAAGTA CGCATCAGAT TGGAAAGAGA TTAAGATCAA ATTTAGGACG CAGGCCAATA CCACAAACGC CTTAGTCAAG TTATGGGGAA AGAACGCCTA CTGGGACGAC ATACAGCTTG TAGAGGGGTT TAACCCAACT AACTACCTAT CAGAAGGGCT ACTGCTAGAG GGATCATCTA GAAACTACAT AGAGAACCCT AGCGGAGAGT TAGGTCCAGA GGGTTGGAGC GGTATAAATT CCTGGGTGGA TATAACCAGT ACTAGAGAAT ATACCTTCTT TGGTTACAAA GCATTACGCG TGCGTAAGGT CAAGCCTGGC CCTGCAGCCA CTATAATCGC AGCAAATCTT ATCCCGGGCA AAACTTACAC ACTCTCAGCA TACATAAAGC TAGAAGATCA AAGACCCGTA GATAGTAGCA TCATACGAGG ATGGTTTTTT GAAGGCATAG ATAGTGTGGA TCAGCTGGAC CTGTCTCAGG TAACAGATAT GAATAGACCT GTAATGCAAT GGGAGCCCGT AGGGGGAGGA TGGTATCGAG GCTACTTCAC ATTCATAGCC AGGCACGATA AGGGACTATA TGGTGTTATG TCTACAGATC GTATTCCAGT CGGTGGCATC TACTACATGG ATGGTGTGCA GCTGGAAGAG AGCGATAATC CATCAACGTA CCTTGATGGC AACTCTGGAT CTGGTTATAA ATGGTCTGGG AAACCTTATC GCAGCACTTC CTATAGAGAA GGCACAACAG TAATAGTTGG TGAGCTCAGG AACTCTACTG GAACTGTATT TTTCAAAGCC AGATCATTAG ACAGCCATAA AACTACAAGC AATGCGACGA TCCTAAAGAT AGGGCAGTTG TACATACAGC AAAGAAGCAA TCAAATGCTC TTCAAATGGG GAGATAAGCT CATCGGATCA GCATCGTTAG ATACCCATCC CAATGCTTAT GCAGTAACGT GGAATTCCCT GAGGATAACC GTGTATTCTA ATGGGAAAGA GATAGGCAGC GTGGCTGCCC GTGGCCCCCG AAAAGGTAGA TTAGTAGAGA TATCGCCAAG TCAAGATACC AAGGCAATAG TCATTTCAGA ATTTAGCCTA TGGAAAAGCG TGCTCACCGA AGAGAACATA AGCTTTCTGA GCTCGCACAG GAGTATAGAT CCAGGAGTAA GGTTCACAGT CGATCCTAAG ACAAGAATAT GGGTTGCCGC TCAGGATCCA ACGAATAAAG ACCTTAGAAT TCTATGGAGT CCGGACGGAA TCCACTGGCA GAGCTGGAAC AAAGGGATTG GTTCTTCGCC TTGGAATATA GGCGGATCGA AAGGGCTCAA AACTGTATGG ATAAAAGTCA TAGATCCCAT AGGAAACTGG ATGATGTACA AAGATGAAAT ATATCTAGGA AAGGAAGGTT GA
|
Protein sequence | MHAVMRTVIV LLLLLALVMP ISAKMAPTAL AEELGGNGSW YRYQGEPSSY DSHSNQNVNC GPTSVAMAIQ YTQNLSVPIR DIRDFIGKNK KSTNLSDLTK ALSHWNVAYR ADIYNVSDIK AALQQGHIII VALDMRAISP GADLNGASAD PSIRIGRFES IAREHWIVIK GITPDENYFI VYDGNVWGGP GNPVYWYSDG TPKGMDRYYA VNEVEKGMRH FGANTIKGIE IISSRLPDRT LRNTVIAKVT YYTLSPEETG KNPGDPGWGI MRNGKKVHWG AVAVDPNYIP LGTKMLIDGW EDQIFVASDT GSQVKGWQID VYWPGSREEA LRKNDELGGW RKITFIGMDA PVSTSTTTPP VNAYINAPET TTTRWINLQL HAEDPEEGVV GMMISNSKNF TDAFEEPYSS IKEWTLPPGD GEKTVYARFK NSDGAWSSLV EAHIHLEEEP PTGAVTLAPK PGLVSYIPFN GSTMAVIGPQ PKISGPVRYA SMNNNKLSNS SFELWAGGIP KDWDSPLRDE SYAAYEPSTE ALDGSTSLFS NSTKEGYIYQ IVPVRANTSY TLAIMAKGNN GAIQIQELQT TGTTSRVLKS HTAGFKYASD WKEIKIKFRT QANTTNALVK LWGKNAYWDD IQLVEGFNPT NYLSEGLLLE GSSRNYIENP SGELGPEGWS GINSWVDITS TREYTFFGYK ALRVRKVKPG PAATIIAANL IPGKTYTLSA YIKLEDQRPV DSSIIRGWFF EGIDSVDQLD LSQVTDMNRP VMQWEPVGGG WYRGYFTFIA RHDKGLYGVM STDRIPVGGI YYMDGVQLEE SDNPSTYLDG NSGSGYKWSG KPYRSTSYRE GTTVIVGELR NSTGTVFFKA RSLDSHKTTS NATILKIGQL YIQQRSNQML FKWGDKLIGS ASLDTHPNAY AVTWNSLRIT VYSNGKEIGS VAARGPRKGR LVEISPSQDT KAIVISEFSL WKSVLTEENI SFLSSHRSID PGVRFTVDPK TRIWVAAQDP TNKDLRILWS PDGIHWQSWN KGIGSSPWNI GGSKGLKTVW IKVIDPIGNW MMYKDEIYLG KEG
|
| |