Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2650 |
Symbol | |
ID | 7873391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2868100 |
End bp | 2870841 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699573 |
Product | ATP-binding region ATPase domain protein |
Protein accession | YP_002889629 |
Protein GI | 237653315 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.574534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCGG GCTGGCTGAT CGTCGGCGCT TCGTTCGCCT ATCTGCTGCT GCTGTTCGCG GTGGCCTACT TCGGCGACCG GCGCGCGGAT GCCGGGCGCT CGGTCATCGA CAATCCCTGG GTTTTCAGTC TTTCGCTCGC GGTGTATTGC ACCGCGTGGA CCTATTTCGG CAGCGTCGGC CGCGCCGCCT CGGGCGGGCC GTGGTTCCTG CCCACCTATC TCGGGCCCAC GCTGGGCCTG ACCCTGGCCT GGCTGGTCGT GCTCAAGATG ATCCGCATCG CGCGCAGCTA CCGGGTCACC TCGATCGCCG ACTTCGTCGC CTCGCGCTAC GGCAAGAGCC ACCTGCTCGG CGGGCTGGTC ACGCTGATCG CGGTGGTCGG CATCGTGCCC TACATCGCGT TGCAGCTGAA GGCGATCTCC AGCGGCTACG CGCTGCTGGT GGGCGAGCAC GACCCGCTGT TCCGCATCGA GAGCCAGGCG GGCTGGTGGC AGGACGGCAC GCTCTACATC GCGCTCGCGC TCGCCGCCTT CACGGTGCTG TTCGGCACCC GCCACCTCGA CACCACCGAG CGCCACGAGG GCATGGTCGC GGCGATCGCC TTCGAGTCGG TGGTCAAACT TTTCGCCTTC CTGGCGGTCG GCATCTACGT CACCTACTGG CTCTACGACG GCTTTGGCGA CCTCTACGCG CGCGCCGCCG CCGTGCCCGA GCTGCACCGC CTGCTCGCCT TCGACGGTGC CGGCCGGCTC GGCTACGGCG GCTGGTTCGC TCACGTGCTG CTGGCCATGC TGTCCATCAT CTTCCTGCCG CGGCAGTTCC AGATCGCGGT GGTCGAGAAC GTCAACGAGC AGCACCTGCG CCGCGCGACC TGGGTCTTTC CCGCCTACCT GCTGGCGATC AACATCTTCG TGATCCCGAT CGCGATCGGC GGGCTGCTGC ATTTCGGCCG CGGCACCGTG GATCCGGACA CCTTCGTGCT GACCCTGCCG CTCGCGCACG GGCAGAGCGC GCTCGCGCTG CTCGTCTTCA TCGGCGGCCT GTCGGCGGCG ACCGGCATGG TGATCGTGGA GGCGATCGCG CTCTCCACCA TGGTGTGCAA CGACCTGGTG ATGCCGATGC TGCTGCGCAG CCGGCGGCTC GACCTCGCGC GCGAGCGCGA CCTCACCGGT CTGCTGCTGG GCATCCGCCG CGGTGCGATC GTGGTGCTGC TGCTGCTCGG CTACCTCTAC TTCCGCCTCG CCGGCGAGGC CTACGCGCTG GTCAGCATCG GCCTCATCAG CTTCGCCGCG GTGGCGCAGT TCGCCCCGGT GGTGATCGGC GGCATGTACT GGCGCGGCGG CACCCGCGAG GGCGCGCTCG CCGGTCTGCT CGCCGGCTTC GCGGTGTGGG TCTATACGCT GCTGCTGCCC TCGTTCGCCA AGTCGGGCTG GCTCGATCCG GGCTTCCTCG AGCACGGCGT GCTCGGCCTG GAATGGCTCA AGCCCGAGCA GCTCTTCGGC CTCACCGGCC TGGACAACAT CAGCCACGCG CTGTTCTGGA GCCTGCTCGC CAACATCGGC TGCTACGTCG CGGTATCGAT GCTGCGCGTG CCGACCGGGC CGGAGGCGAC CCAGGGCGCG CTCTTTGTGG ACGTGTTCCG GAGCGGCGCG GCGGCGCCGG CGAGCTTCTG GCGCACCGGC GCCGAGGCCG GCGAGCTGGT GCCGCTGGTG GCGCGCTTCC TCGGGCAGCG CCGCACCGAG GAGGCCTTCG GCGCCTACGC GCGCTCGCAC GGCCTGCAGC AGGTCTCCGA GCTCAAGGCC GACCCCGAGC TGGTGCACTT CGCCGAGACC CTGCTCGCCG GCGCGATCGG CAGCGCCTCC GCGCGGGTGA TGGTGTCGAC CGTGGTGCAG GCGGAGCCGC TCGGGCTCGG CGAGGTCATG GACATCCTCG ACGAGGCCTC GCAGGTGCGC GCCTACTCGC ACAAGCTCGA GGAGAAGTCG CGTGCGCTCG AGGCCGCGAC CGCCGAGCTG CGCGCCGCCA ACGAGCGCTT GAAGGAGCTC GACCGCCTCA AGGACGACTT CATGTCCTCG GTGACGCACG AGCTGCGCAC GCCCTTGACC TCGATCCGAG CGCTCTCCGA GATGATGCTC GACGACCCCG ACATCGACGT CGAGGACCGC CAGCGCTTCC TCGGCATCAT CGTGTCCGAG GCGGTGCGCC TGTCGCGCCT GGTGAACCAG GTGCTCGACA TGGCCAAGAT CGAGTCCGGC CATGCCGAGT GGCACAACAC CGACATCGAC CTGTGCGAAC TGGTGCGCCA CTCGGGCGAC GCCACCCTGC AGCTCTTCCG CGACCGCGGC GCCGAACTGC ACCTGCACCT GCCGGAGTCG GTGCCGACCC TGCGTGCCGA CCACGACCGA CTGCTGCAGG TGATGCTGAA CCTGCTGTCG AACGCGGCGA AGTTCGTGCC CACGGGCAGC GGTCGCGTGG ATGTGACGTT GAGCTGCGAT GGCGAACGCC TGCGCGTCGC GGTGAAGGAC AACGGGCCGG GGATCGAGCC GGCGCAGCAG CCGGTGGTGT TCGAGAAGTT CCGCCAGGGC GGCGACGAAC GCTCGCGACC GCAGGGTACC GGCCTGGGGC TGCCGATCAG CCGCCAGATC GTCGAGCACT TCGGCGGTCG GCTGTGGCTG GAGTCGGTGC CGGGCGAGGG CGCGACATTC TCCTTCGAGC TGCCGCTGAA GTCGGCGGCG GGCGAGGGAT GA
|
Protein sequence | MLPGWLIVGA SFAYLLLLFA VAYFGDRRAD AGRSVIDNPW VFSLSLAVYC TAWTYFGSVG RAASGGPWFL PTYLGPTLGL TLAWLVVLKM IRIARSYRVT SIADFVASRY GKSHLLGGLV TLIAVVGIVP YIALQLKAIS SGYALLVGEH DPLFRIESQA GWWQDGTLYI ALALAAFTVL FGTRHLDTTE RHEGMVAAIA FESVVKLFAF LAVGIYVTYW LYDGFGDLYA RAAAVPELHR LLAFDGAGRL GYGGWFAHVL LAMLSIIFLP RQFQIAVVEN VNEQHLRRAT WVFPAYLLAI NIFVIPIAIG GLLHFGRGTV DPDTFVLTLP LAHGQSALAL LVFIGGLSAA TGMVIVEAIA LSTMVCNDLV MPMLLRSRRL DLARERDLTG LLLGIRRGAI VVLLLLGYLY FRLAGEAYAL VSIGLISFAA VAQFAPVVIG GMYWRGGTRE GALAGLLAGF AVWVYTLLLP SFAKSGWLDP GFLEHGVLGL EWLKPEQLFG LTGLDNISHA LFWSLLANIG CYVAVSMLRV PTGPEATQGA LFVDVFRSGA AAPASFWRTG AEAGELVPLV ARFLGQRRTE EAFGAYARSH GLQQVSELKA DPELVHFAET LLAGAIGSAS ARVMVSTVVQ AEPLGLGEVM DILDEASQVR AYSHKLEEKS RALEAATAEL RAANERLKEL DRLKDDFMSS VTHELRTPLT SIRALSEMML DDPDIDVEDR QRFLGIIVSE AVRLSRLVNQ VLDMAKIESG HAEWHNTDID LCELVRHSGD ATLQLFRDRG AELHLHLPES VPTLRADHDR LLQVMLNLLS NAAKFVPTGS GRVDVTLSCD GERLRVAVKD NGPGIEPAQQ PVVFEKFRQG GDERSRPQGT GLGLPISRQI VEHFGGRLWL ESVPGEGATF SFELPLKSAA GEG
|
| |