Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3897 |
Symbol | |
ID | 7873545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4292407 |
End bp | 4294353 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700836 |
Product | hypothetical protein |
Protein accession | YP_002890859 |
Protein GI | 237654545 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACTT CCACAACGAT AACGCGCTCA GCATGGCTTT TCCTCGGACT GCTCGCCATC GCCTGCGGAT TGCTGTACAT CCCCGGCCTG ACAGGCGCCC TCTACTACGA CGACATCCGC CCTCTCTCCG GCCTCGCCAG CGTCGTCAAC CTCGACTCCG CGCTCTACTA CCTCTCCTCC GAGATCTCCG GTCCACTCGG CCGTCCGGTC GCCATGCTCA GCTTCCTGGT GCACGTCGAG GACTGGCCCG GAGCCGTCGA GAACATCTTC CTTTTCAACG TACTGCTGCA CCTGACCAAC GGCACGCTGG TCGCCCTGCT CGTCCATCGG CTGCTCAGTC TGAGGGGGGT CGCAGGCAGT GCGCCCGCCT GGATTGCAGC CAGCACAGCT GCAATCTGGA TGCTGATGCC GCTGCAGGTC TCCTCCTCGC TGATCGCGGT GCAGCGCATG GCGACGCTGT CCGCCTTCTT CGTGCTCGCG GGCCTGTTGA TCCATGTTCA GGGCATCGCG ATCGAGGACC GGCGTCGTGC ACTCGGCGCC GCTCTGCAGG CGCTCGGCCT CGTGGGGTTC ACCCTGCTCG CGATGTTCAC CAAGGAAAAC GGCATCCTGC TGCCCGTTTT CGCGCTCGTG ATCCAGGCGA CGCTGCTCGC GGACCACACT TCCCCGGGCC GCCTTCGACT CCTGCGCACC GTCGCCGGCG GCGGGGCGCT CGCGATCATC CTGGCCTACC TCGCATACAG CGCATTCCGT TCCGGTGGCG TGTTCGGGGG GCGTGAGTTC GATCTGCTCG AACGCATCCA GACCCAGCCG CTGATCCTGC TCGAATACCT CCGCGAAGCC TTCGCACCCC GTCCCTATGG CTTGCACCCC TTCCATGACG GCTACCCCAA GGTCAGCGCG CTCGCAGAGC ACCCCGTGGC GCTCTTCGCC GCCATCCTGT GGCCGACGCT CGCGGTGCTC GCGGTCCGCT TCCGCCGCCG CTATCCGGTC GCGGCCTTCG CCGTGCTGTG GTTTCTTGCT GCCCACCTGC TCGAATCCAC CGTCCTCGGG CTTGAGCTGT ACTTCGAGCA TCGCAACTAC CTCGCGCTGT TCGGACCCTG CCTTGCAATC GCCTGGGCGG TGGGGCGGAC ACCAGTTCCG TACCGCCGAC TCGCCGTCGC GGGCTTCGCG GCCTATCTCG CAACGCTCGG GGCGATCCTG TTCCACGTGA CCAGCCTCTG GGGCGACAAG CTCGACGCAG CGGAGACCTG GTTCGTTCAC GCCTACAAGT CGCCGCGAGC TGCGGAGCAC CTCGCACTCC TGTACCTCGA GCAGGGCCGC TTCAACGAGG CCTACCAGGT CATACGGATC CAGGTGGATG ATTGCCCTCA GTGCCTCGCC TCGGTGACCC AGGCCGCGCT GCTCGCCTGC GCAGCGGGAG AGGCTCAGCG CACCCGAGAC TACTTCGCTC AAGCGGAAGC GCTTGCCATC GAGGCCCGCA ACGTCAGCGG AGCGGCGACG ACGCTGACTG CAATGCACAA CGCAATCGAA GACGGAAAAT GCTCCCTGGT CGACTACGAT CAACTCGAAA CACTCAACCG CAGCCTCCTG CGCCACCAAA CGGGGGGGCT CGGGACACTC AGCCGCAAGG CGATTCACAT GAACCTGGAA CGGATCGCAC TGGCCAAGGG TGACGCGAAT TCCGCACTGG ACCACCTGAA ACAGGCCTGG GCGGTAGACC GGGACCGCGC GCTCGGACAT GCGATCATTG ACGCACTACT CGAACGGCAT GAAATCGAGA ACGCGGAGGT ATTCCACCGC AAAGTACTTT GCCGCGAGTT TCCCAAGCAC CCGGTGCTCG CCAACGTCGC ACGCAAGCAA TGCGATGAAT CGATGCAGGC CATACTCGAG GCGGCAAGCA GTCATCCGCA ACGCACGGGT GACGCAAAAA CCGCCGCAAC ACCATGA
|
Protein sequence | MSTSTTITRS AWLFLGLLAI ACGLLYIPGL TGALYYDDIR PLSGLASVVN LDSALYYLSS EISGPLGRPV AMLSFLVHVE DWPGAVENIF LFNVLLHLTN GTLVALLVHR LLSLRGVAGS APAWIAASTA AIWMLMPLQV SSSLIAVQRM ATLSAFFVLA GLLIHVQGIA IEDRRRALGA ALQALGLVGF TLLAMFTKEN GILLPVFALV IQATLLADHT SPGRLRLLRT VAGGGALAII LAYLAYSAFR SGGVFGGREF DLLERIQTQP LILLEYLREA FAPRPYGLHP FHDGYPKVSA LAEHPVALFA AILWPTLAVL AVRFRRRYPV AAFAVLWFLA AHLLESTVLG LELYFEHRNY LALFGPCLAI AWAVGRTPVP YRRLAVAGFA AYLATLGAIL FHVTSLWGDK LDAAETWFVH AYKSPRAAEH LALLYLEQGR FNEAYQVIRI QVDDCPQCLA SVTQAALLAC AAGEAQRTRD YFAQAEALAI EARNVSGAAT TLTAMHNAIE DGKCSLVDYD QLETLNRSLL RHQTGGLGTL SRKAIHMNLE RIALAKGDAN SALDHLKQAW AVDRDRALGH AIIDALLERH EIENAEVFHR KVLCREFPKH PVLANVARKQ CDESMQAILE AASSHPQRTG DAKTAATP
|
| |