Gene Tmz1t_3270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3270 
Symbol 
ID7874491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3581110 
End bp3582720 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID643700204 
Productexosortase 1 
Protein accessionYP_002890242 
Protein GI237653928 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.648481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATGA AAGGAACCGC GATGCTTCAA TCCGTGCGGC CGGGCGCGAT GCCGACCCAC 
CTGCAGGCGA CCGACGCCCG CCCTGCGACG CTCGGTGATC CGCGCACGCG CAGGCTTCAC
CTGGCGGTGC TGGCGCTGGG ACTGCTGTGG CTGCTGTTCT GGTATCGCGA CACCTTCCTG
GCCATGGTGG GCATCTGGGA CCGCTCGGAC ACCTTCGCCC ACGGCTTCGT GATCGCGCCG
ATCTCGGCCT GGCTGGTATG GCGGCGCCGG CACTTCATCG ACCATGTGCC GATCGCGCCC
TCGCTGCTCG GCATCTGCGC CGGACTCGTC GCCGGCGCGG CCTGGTTGCT CGGCGAGCTC
GCCAGCGTCG ATGCGGTGTC GCAGTTCGCC TTCGTGGGCA TGCTGGTGGG CTTCGTCTGG
GCGGTGATGG GCAGCGCGGT GGTGCGCACC TACGCCTTCC CGATCGGCTT CCTGTTCTTC
ATGGTGCCGT TCGGCGAGTT CCTCTTCCCC ACCATGATGC AGTGGACCGC CGACTTCATC
ATCTGGGCGG TGCGCGCCTC CGGAGTGCCG GTGTACGCCG AAGGCTTCCT GCTGGTGATC
CCCTCCGGGC GCTGGCAGGT GGTGGAGGGG TGCAGCGGCG TGCGCTACCT GATGGCCTCG
ATCGTCGTCG GCAGCCTGTA CGCTTACCTG AACTACCGCA GCACGCAGAA GCGCCTGCTT
TTCGTCGCCG CCTCGATCGT GCTGCCGGTA TTCGCCAACT GGCTGCGCGC CTACGGCATC
GTGATGCTCG GCCACCTCAC CGACAACCGC CTCGCTGCCG GGGCCGACCA CCTGATCTAC
GGCTGGGTCT TCTTCGGCAT CATCATCCTC GCGCTGTTCT GGATCGGTTC GCGCTGGCAG
GAGGACGAGG AAGAGGCCCC GGCCATGCCC GTCGGCTCCG CGCGATCGTC GGGCGTACGC
ACCGGTGGCC TGGGCTGGCT GGCCGTGGCG GTGGTCGCGG TGGGGCTGTG GGTGCCGGTC
CTCGGCCACC TCGAGCGCCA GGGCGAGCAG GGGCCGGTGC GCTTCGCCGC GCTGGAGGGG
CAGGGCGCGT GGCAGCGGAC GGAAGCGGGC GAATTGCCGC CGTGGTCGCC GAGCTATTCC
GGCATGCGCG ACACCTTCCG TTCGACCTGG CGCGCGGGGA GCGTTCCGGT GGGGGTGTAC
ATCGGCTATT ACCGCGACCA GGGACCGGGC AGGGAGTTGA TCAATTCCGA GAACCGCGTG
CTGATCAGCA AGGACCCGGT GTGGCGCATG ACCGCCGCGG GTCCGGTCCG GGTGCAACTC
GGCGACCAGG CGCAGGCGTG GCGCACGCTG GAGATGGCGA GCGAGCATGC ACGCATGGTC
GTCTGGTACG CGTACTGGAT CGGCGGCCGG TGGACCACCA GCGATCATCT TGCCAAGGTC
TATCTGGCGC TGAGCCGCCT GCGCGGCGAA GGGGACGATT CGGCGGTGGT CATGCTGCAT
GCGCCGCAGG GTCAGGGCGG GCACGCCGGC ACGATCGCCG CGCTTGAAGA CTTCGCACGC
GAGATGGGCG GGCCCCTGCA GGCCATGCTG GATCGCACCG CGAACCCTTG A
 
Protein sequence
MNMKGTAMLQ SVRPGAMPTH LQATDARPAT LGDPRTRRLH LAVLALGLLW LLFWYRDTFL 
AMVGIWDRSD TFAHGFVIAP ISAWLVWRRR HFIDHVPIAP SLLGICAGLV AGAAWLLGEL
ASVDAVSQFA FVGMLVGFVW AVMGSAVVRT YAFPIGFLFF MVPFGEFLFP TMMQWTADFI
IWAVRASGVP VYAEGFLLVI PSGRWQVVEG CSGVRYLMAS IVVGSLYAYL NYRSTQKRLL
FVAASIVLPV FANWLRAYGI VMLGHLTDNR LAAGADHLIY GWVFFGIIIL ALFWIGSRWQ
EDEEEAPAMP VGSARSSGVR TGGLGWLAVA VVAVGLWVPV LGHLERQGEQ GPVRFAALEG
QGAWQRTEAG ELPPWSPSYS GMRDTFRSTW RAGSVPVGVY IGYYRDQGPG RELINSENRV
LISKDPVWRM TAAGPVRVQL GDQAQAWRTL EMASEHARMV VWYAYWIGGR WTTSDHLAKV
YLALSRLRGE GDDSAVVMLH APQGQGGHAG TIAALEDFAR EMGGPLQAML DRTANP