Gene Tmz1t_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1871 
Symbol 
ID7084294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2111743 
End bp2114172 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content74% 
IMG OID643698894 
Producthypothetical protein 
Protein accessionYP_002355519 
Protein GI217970285 
COG category 
COG ID 
TIGRFAM ID[TIGR02242] phage tail protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0347166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA ACGGCACCCG CTTCCTGCTC CTCGACGGCG CGGCCGACTT CCACAACACA 
AGCCGGCAAT GCAGCTGGGA CCACGAGCAG CGCGCCTTCA CGCTGACCCG CCAGGACGCC
CCGCGCCTGC CACGCCTGGC GGCCGCGCGC GCACGCGAGC GCCTGCTCGC GGCGACGCCC
TGGATGCTCG ACGAGCACGG CCAGCTCGGT CGCCTGTCGG ACGACGGCCT GCGCCTGGAG
TGCGCGCCGT CCTGGCCGCC ACGCACCTGG CAGCCGGTGC GCGCCACGCT CGACGAAACG
CACGCCGATG CCTCCGCCCT CGAGGTGCTG GTGCTCGATC CGGTCGATGC GCCGGCGGGC
CGCTTCACCG ACCTCGCCTT CGGCGGCAGC GGCCTGGTCG TGCTGCCGTG GAGCGACGGT
GGCGTACAGC ATGGTCTCAC CGCGGTGCAT CTGCGCCGGC GCTGGCAGGC CCGCTGCGCG
CTGCCCTTCG CGCCGCGCCG CGCCTGGGTC GAGGCCGCCA GCGGCACCGA CCGCGTGTGG
CTGCTCGGCG AGACGCAGCT GGGCCTCGCC GTGGGGGCGC CCCTGCCGCA GCCCTACCGC
GGCCGCCCCG AGCGCTTCGA GCCGCTGCAG ACCAATCCCG ATCCGCTGCG CCTGGCGTGG
ACGCTGGCCC TGCCGCCCCA CGGCGGGCTG CTCGGGCTGT GCACCGACGA AGACCATCTC
TTCGTGCTCG GCGAAACCCC CGACAGCACG GCGGACGCCC CGCGCATGCA GATCTTCATG
CGTGCGCTCG GCGCCGCACC GGGCGAGGGC TTCGACATCC GGCGCCTGCC GGACGGGCTG
CCGCTCGCGA CCGACCTCGC CGCGGCCGGC GAGGGCCGGC TCCTGCTCCT GCCGCCCATC
GACGAAGGCG CGGAACCTGG AACGCGGCGC GACTGCCCGC TCATCTCCTT GCGCGAGGAC
GCGCCCACCG CAGAGTTGGT GCCCGAGCGC TGGCCGCGCC GTCCCGCCGC AGCGCTGCCC
GCCGCGGACC GCTTCGTGCG ACACCGCGAC GGCCGGCCAC GCGCCCTGAG CGCAGACGGC
CCGGTCCGGC TCTACCGCCT CGCGCAGGCC CGCTTCGCGC CCAACGGCAC GGTCACGCTG
AGCACGCCGC TCGACTCGGC GATGCCCGAC ACGCTGTGGG ACCGCATCTT CATCGACGCC
TGCATCCCCC CCGGCTGCCG GATCGATTTC GCCGTGCAGG CAGGCGACGA CCGCGAGAAC
CTGCCCGCGG AGTGGATCGC GCAGCCGCAG CCGGTGCTGA CCGCGGTGTC GTCGGAGCTA
CCCTTCGCGT CCGGGCGCGC ACCCGGGAGT GGGGATCACG CAGATCGTTC CGGCCTGTTC
GAGCTGCTGA TCCAGCGCGC CAGCGGCGCG GTGCGCGAGG TGCGCGGGCG CTACCTGCGC
CTGCGCATCA CGATGCACGG CGACGGCCGC CACAGCCCGG CGATCTTCGC GCTGCGCGTG
CAGTACCCGC GCTTCTCCTG GCAGACCCAC TACCTCCCCG AGCATTTCCA GCAGCAGGAG
CGTCCGCTCG CGAGCGCGGA GGCCAACCAG GCAGAGGCGA ACGGCGCCGA CTTCCGCGAG
CGTCTGCTGG CGAGTTTCGA GGGTCTGCTG ACCCCGATCG AGGACCGCAT CGCCGCAGCC
GAGATCCTGC TCGACCCCGC GGTCGCGCCC GTGGCGCACC TGCCCGGCCT CGCCGCGATG
CTCGGCACCA CGCTGCCGCC CCACTGGCCG GAAGCACGCC GCCGGCGCTG GCTGGGCGCG
CAGGGCATGC TTCAGCAGAG CCACGGCAGC TACCGCGGCC TGCTGCTCGC GCTCGACATC
CTCACCGATG GAGCGGTGGC GCGGGGCGCG GTGATCCCGG TCGAGCACTT CCGCCTGCGC
CGCACGATGG CGACGATCCT CGGTGTGGAC ATGGATGACC GCGACCACCC GCTGACCCTG
GGCACCGGTC TTTCCGGCAA CAGCCTCGTC GGTGACAGCC TGATCCTGTC CGACGACCTC
GCCCGCGAGT TCCTCGCCCT CTTCGCCCCG GAGGTTGCCG AAGCGAAGGG CGAGGCCGCG
GTGGTCGAAC GCTTCTTCGA AGAAGCCGCG CGGCGCATGA CGGTGATCCT GCACGGACCG
GCGCGACGGC TCGCAGCCGT CGTGCGCGAC GCCCTGCCCG CGCTCGTGCC CGCGACCGTG
CAATGGGCGA TCCGCAGCAG CGAGCACCCC TTCGTGCCGG GTCTGTCGCC GCTGCTGGGC
ATCGACACCT GGCTGGAAGC CTCGCCACCA GCGCGGCCCG TGGTGCTCGA CCGCACACGG
CTGGGTCGCG GTGACCTGCT GCACAACCCG GTCGCCCTCG ACCCCGAGCA CGCCGTGCCG
ATCGACGCGA CCGTCCTGGA CGCACCGTGA
 
Protein sequence
MNSNGTRFLL LDGAADFHNT SRQCSWDHEQ RAFTLTRQDA PRLPRLAAAR ARERLLAATP 
WMLDEHGQLG RLSDDGLRLE CAPSWPPRTW QPVRATLDET HADASALEVL VLDPVDAPAG
RFTDLAFGGS GLVVLPWSDG GVQHGLTAVH LRRRWQARCA LPFAPRRAWV EAASGTDRVW
LLGETQLGLA VGAPLPQPYR GRPERFEPLQ TNPDPLRLAW TLALPPHGGL LGLCTDEDHL
FVLGETPDST ADAPRMQIFM RALGAAPGEG FDIRRLPDGL PLATDLAAAG EGRLLLLPPI
DEGAEPGTRR DCPLISLRED APTAELVPER WPRRPAAALP AADRFVRHRD GRPRALSADG
PVRLYRLAQA RFAPNGTVTL STPLDSAMPD TLWDRIFIDA CIPPGCRIDF AVQAGDDREN
LPAEWIAQPQ PVLTAVSSEL PFASGRAPGS GDHADRSGLF ELLIQRASGA VREVRGRYLR
LRITMHGDGR HSPAIFALRV QYPRFSWQTH YLPEHFQQQE RPLASAEANQ AEANGADFRE
RLLASFEGLL TPIEDRIAAA EILLDPAVAP VAHLPGLAAM LGTTLPPHWP EARRRRWLGA
QGMLQQSHGS YRGLLLALDI LTDGAVARGA VIPVEHFRLR RTMATILGVD MDDRDHPLTL
GTGLSGNSLV GDSLILSDDL AREFLALFAP EVAEAKGEAA VVERFFEEAA RRMTVILHGP
ARRLAAVVRD ALPALVPATV QWAIRSSEHP FVPGLSPLLG IDTWLEASPP ARPVVLDRTR
LGRGDLLHNP VALDPEHAVP IDATVLDAP