Gene Tmz1t_2705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2705 
Symbol 
ID7873447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2932222 
End bp2933967 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content75% 
IMG OID643699628 
Productvon Willebrand factor type A (vWA) domain-containing protein 
Protein accessionYP_002889684 
Protein GI237653370 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCG GCGATGCAGG TTCCGGCGTG GCGCCCGCCG TCGCGCAGCA ATCGGCCTCC 
GCGGCGGCCG ACAGCGCGCC CGCGCCGCTG CCGCCGGACC GCGCCCGGGC CGCCGAGGCT
CCACCCGTCC CCCGGCTCTT CGAGGCGCTC GAAGCCGTGC TCGCGCCGCT CGATCGCCTG
CCGCGTCCGC TGTGGCTCGG TGGCATGACC CACTCCGGCG GCGAACTCGC GGGCCGTCTC
GCCGCGCTCG AAGCCTGGCG GGCTGCGCTG CTCGCCGGCG GCCTGCCCGG GTCTTGCGCG
GCCTGGCCCG AGGCGGAGGT GCAGGAGGCC GTGCGCGAGG TCTTCCGGGG CCTGGGCTTG
CCCGCCTACT GTGCAGACCA GCCCGCGCTG GTCGATACCG TGCTGCAAGG CCTGCTCTTC
CACCTCGACC TGATCGTCGA CTACCGCGAC CGCGGCGATA CCGAGGCCGC GGCGCAGGCG
AGGGCGCTCG ATTCCTTCGC TGCCGACTGG GCCGAGCGCT GCGGCGAGAT CGACGAGCTG
GTCGGCGCCT TCGGGGATCT CGGCGATCTG CTCGACAACG CCCGCTGGGA TGCGCTGCGC
GGCCTGCTGC GCAGCACGGA CTGGCGCGAG GTGCTGCGCA TCCGCGCGCT GATCGAGGGC
CTGCCCGAGC TCGCCCGCAT CCTGCGCGCA CTCGGCCGCG CCTGCCCCAC CGACGAGGAC
GCCGAATCCA GCCGGGCATT GCACGCCGTG GTCGAGCACA CCGAGATACA GCGCAGCGTC
TCGCACCGGG TGCGGGTGCC CGACCTGCCC GGCGAAACGC GCGGCGTGCA GCGCTCGGGC
CGGATCGCGC GCATGCTGCC GGCCGAGGCG ACGCTGCTCG GCCATCCGCG CCTGCGCTTG
GTCTGGCATG CGCGCCGCGC CGAGCGCACG CTGCTCGCCT ACGAGGACGA CGACCACCTG
CAGGAGGACT GCCTGCGCCC GGCGCCGGTG CTGCGCCCCA GTCAGCGCCC TGCGCCGGCA
CGGCGCCTGG AGCAGGGGCC GATGCTGGTG TGCGTGGATA CCTCGGGCTC GATGCAGGGG
GGCGCCGAGG CGGTGGCCAA GGCGGTCGTG CTGGAGGCGG TGCGCTGCGC TCACGCCCGG
CGCCGCGCCT GCCGGGTGTA TGCCTTCGGC GGGCCCGACG AGGTGGTCGA GATGGAGCTC
GGCGTCGATG TCGATGGCGT CGGCCGGCTC GCCCGCTTCC TCGGCCAGGG CTTCGGCGGC
GGCACCGACA TCTGCGCCCC GCTCGAGCGT GCGCTCGCCC GCCTCGACGA AGCCGGCTGG
CAGCTCGCGG ACCTGCTGAT CGCGTCCGAT GGCGAATTCG GCGCCACCCC GGCGCTCGCC
GCCCGCGTCG AGGCCGCCCG CCGCGAGCGC GGCCTGCGCG TGCAGGGCAT CCTGATCGGC
GACCGCGAGA CCATCGGCCT GCTCGAACTC GCCGACGACA TCCACTGGGT GCGCGACTGG
CGGCGCTATG GAGGCGGTAC GGACAAGCCG GGTGGGGCCG ACGCGGACAA GCGGCGTGGC
GATGGCGCCG GCGCGAACGC CCCGACCCTT GCTGCCGGCG GTGGCTCGCC GGTGCATTCC
AGCCACCTCA CCGCGGACTA TTTCCCCGGT GCCCTGCGCA CCCCCGAGAA CCGTGCCGCC
ACCGTCACAC CCGAGGCGGC TGCCGCTGCC ATCCGTGCCG GCCGCCACCG CGGCGACCGG
TTGTAG
 
Protein sequence
MATGDAGSGV APAVAQQSAS AAADSAPAPL PPDRARAAEA PPVPRLFEAL EAVLAPLDRL 
PRPLWLGGMT HSGGELAGRL AALEAWRAAL LAGGLPGSCA AWPEAEVQEA VREVFRGLGL
PAYCADQPAL VDTVLQGLLF HLDLIVDYRD RGDTEAAAQA RALDSFAADW AERCGEIDEL
VGAFGDLGDL LDNARWDALR GLLRSTDWRE VLRIRALIEG LPELARILRA LGRACPTDED
AESSRALHAV VEHTEIQRSV SHRVRVPDLP GETRGVQRSG RIARMLPAEA TLLGHPRLRL
VWHARRAERT LLAYEDDDHL QEDCLRPAPV LRPSQRPAPA RRLEQGPMLV CVDTSGSMQG
GAEAVAKAVV LEAVRCAHAR RRACRVYAFG GPDEVVEMEL GVDVDGVGRL ARFLGQGFGG
GTDICAPLER ALARLDEAGW QLADLLIASD GEFGATPALA ARVEAARRER GLRVQGILIG
DRETIGLLEL ADDIHWVRDW RRYGGGTDKP GGADADKRRG DGAGANAPTL AAGGGSPVHS
SHLTADYFPG ALRTPENRAA TVTPEAAAAA IRAGRHRGDR L