Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1871 |
Symbol | |
ID | 7084294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2111743 |
End bp | 2114172 |
Gene Length | 2430 bp |
Protein Length | 809 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698894 |
Product | hypothetical protein |
Protein accession | YP_002355519 |
Protein GI | 217970285 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02242] phage tail protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0347166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA ACGGCACCCG CTTCCTGCTC CTCGACGGCG CGGCCGACTT CCACAACACA AGCCGGCAAT GCAGCTGGGA CCACGAGCAG CGCGCCTTCA CGCTGACCCG CCAGGACGCC CCGCGCCTGC CACGCCTGGC GGCCGCGCGC GCACGCGAGC GCCTGCTCGC GGCGACGCCC TGGATGCTCG ACGAGCACGG CCAGCTCGGT CGCCTGTCGG ACGACGGCCT GCGCCTGGAG TGCGCGCCGT CCTGGCCGCC ACGCACCTGG CAGCCGGTGC GCGCCACGCT CGACGAAACG CACGCCGATG CCTCCGCCCT CGAGGTGCTG GTGCTCGATC CGGTCGATGC GCCGGCGGGC CGCTTCACCG ACCTCGCCTT CGGCGGCAGC GGCCTGGTCG TGCTGCCGTG GAGCGACGGT GGCGTACAGC ATGGTCTCAC CGCGGTGCAT CTGCGCCGGC GCTGGCAGGC CCGCTGCGCG CTGCCCTTCG CGCCGCGCCG CGCCTGGGTC GAGGCCGCCA GCGGCACCGA CCGCGTGTGG CTGCTCGGCG AGACGCAGCT GGGCCTCGCC GTGGGGGCGC CCCTGCCGCA GCCCTACCGC GGCCGCCCCG AGCGCTTCGA GCCGCTGCAG ACCAATCCCG ATCCGCTGCG CCTGGCGTGG ACGCTGGCCC TGCCGCCCCA CGGCGGGCTG CTCGGGCTGT GCACCGACGA AGACCATCTC TTCGTGCTCG GCGAAACCCC CGACAGCACG GCGGACGCCC CGCGCATGCA GATCTTCATG CGTGCGCTCG GCGCCGCACC GGGCGAGGGC TTCGACATCC GGCGCCTGCC GGACGGGCTG CCGCTCGCGA CCGACCTCGC CGCGGCCGGC GAGGGCCGGC TCCTGCTCCT GCCGCCCATC GACGAAGGCG CGGAACCTGG AACGCGGCGC GACTGCCCGC TCATCTCCTT GCGCGAGGAC GCGCCCACCG CAGAGTTGGT GCCCGAGCGC TGGCCGCGCC GTCCCGCCGC AGCGCTGCCC GCCGCGGACC GCTTCGTGCG ACACCGCGAC GGCCGGCCAC GCGCCCTGAG CGCAGACGGC CCGGTCCGGC TCTACCGCCT CGCGCAGGCC CGCTTCGCGC CCAACGGCAC GGTCACGCTG AGCACGCCGC TCGACTCGGC GATGCCCGAC ACGCTGTGGG ACCGCATCTT CATCGACGCC TGCATCCCCC CCGGCTGCCG GATCGATTTC GCCGTGCAGG CAGGCGACGA CCGCGAGAAC CTGCCCGCGG AGTGGATCGC GCAGCCGCAG CCGGTGCTGA CCGCGGTGTC GTCGGAGCTA CCCTTCGCGT CCGGGCGCGC ACCCGGGAGT GGGGATCACG CAGATCGTTC CGGCCTGTTC GAGCTGCTGA TCCAGCGCGC CAGCGGCGCG GTGCGCGAGG TGCGCGGGCG CTACCTGCGC CTGCGCATCA CGATGCACGG CGACGGCCGC CACAGCCCGG CGATCTTCGC GCTGCGCGTG CAGTACCCGC GCTTCTCCTG GCAGACCCAC TACCTCCCCG AGCATTTCCA GCAGCAGGAG CGTCCGCTCG CGAGCGCGGA GGCCAACCAG GCAGAGGCGA ACGGCGCCGA CTTCCGCGAG CGTCTGCTGG CGAGTTTCGA GGGTCTGCTG ACCCCGATCG AGGACCGCAT CGCCGCAGCC GAGATCCTGC TCGACCCCGC GGTCGCGCCC GTGGCGCACC TGCCCGGCCT CGCCGCGATG CTCGGCACCA CGCTGCCGCC CCACTGGCCG GAAGCACGCC GCCGGCGCTG GCTGGGCGCG CAGGGCATGC TTCAGCAGAG CCACGGCAGC TACCGCGGCC TGCTGCTCGC GCTCGACATC CTCACCGATG GAGCGGTGGC GCGGGGCGCG GTGATCCCGG TCGAGCACTT CCGCCTGCGC CGCACGATGG CGACGATCCT CGGTGTGGAC ATGGATGACC GCGACCACCC GCTGACCCTG GGCACCGGTC TTTCCGGCAA CAGCCTCGTC GGTGACAGCC TGATCCTGTC CGACGACCTC GCCCGCGAGT TCCTCGCCCT CTTCGCCCCG GAGGTTGCCG AAGCGAAGGG CGAGGCCGCG GTGGTCGAAC GCTTCTTCGA AGAAGCCGCG CGGCGCATGA CGGTGATCCT GCACGGACCG GCGCGACGGC TCGCAGCCGT CGTGCGCGAC GCCCTGCCCG CGCTCGTGCC CGCGACCGTG CAATGGGCGA TCCGCAGCAG CGAGCACCCC TTCGTGCCGG GTCTGTCGCC GCTGCTGGGC ATCGACACCT GGCTGGAAGC CTCGCCACCA GCGCGGCCCG TGGTGCTCGA CCGCACACGG CTGGGTCGCG GTGACCTGCT GCACAACCCG GTCGCCCTCG ACCCCGAGCA CGCCGTGCCG ATCGACGCGA CCGTCCTGGA CGCACCGTGA
|
Protein sequence | MNSNGTRFLL LDGAADFHNT SRQCSWDHEQ RAFTLTRQDA PRLPRLAAAR ARERLLAATP WMLDEHGQLG RLSDDGLRLE CAPSWPPRTW QPVRATLDET HADASALEVL VLDPVDAPAG RFTDLAFGGS GLVVLPWSDG GVQHGLTAVH LRRRWQARCA LPFAPRRAWV EAASGTDRVW LLGETQLGLA VGAPLPQPYR GRPERFEPLQ TNPDPLRLAW TLALPPHGGL LGLCTDEDHL FVLGETPDST ADAPRMQIFM RALGAAPGEG FDIRRLPDGL PLATDLAAAG EGRLLLLPPI DEGAEPGTRR DCPLISLRED APTAELVPER WPRRPAAALP AADRFVRHRD GRPRALSADG PVRLYRLAQA RFAPNGTVTL STPLDSAMPD TLWDRIFIDA CIPPGCRIDF AVQAGDDREN LPAEWIAQPQ PVLTAVSSEL PFASGRAPGS GDHADRSGLF ELLIQRASGA VREVRGRYLR LRITMHGDGR HSPAIFALRV QYPRFSWQTH YLPEHFQQQE RPLASAEANQ AEANGADFRE RLLASFEGLL TPIEDRIAAA EILLDPAVAP VAHLPGLAAM LGTTLPPHWP EARRRRWLGA QGMLQQSHGS YRGLLLALDI LTDGAVARGA VIPVEHFRLR RTMATILGVD MDDRDHPLTL GTGLSGNSLV GDSLILSDDL AREFLALFAP EVAEAKGEAA VVERFFEEAA RRMTVILHGP ARRLAAVVRD ALPALVPATV QWAIRSSEHP FVPGLSPLLG IDTWLEASPP ARPVVLDRTR LGRGDLLHNP VALDPEHAVP IDATVLDAP
|
| |