Gene Tmz1t_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1451 
Symbol 
ID7083534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1617386 
End bp1618729 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID643698469 
Productpeptidase M24 
Protein accessionYP_002355106 
Protein GI217969872 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.582435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC CCGGCACCGC CCCCATGGAC ACCACCCCCT TCCGCGCCCG CCGCGCCCGC 
CTGCTGCAGC GCATGCAGGC CGCCGGCGGC GGCGTCGCCA TCCTGCCCAC CGCCCCCGAG
CGCGTGCGCA ACCGCGACGC CCACTACGCC TACCGCCACG ACAGCTACTT CTACTACCTC
AGTGCCTTCC GCGAACCCGA GGCCGTGGTC GTGCTGGTGG CGGGCAAGGA GACGAAGCAG
ATCCTCTTCT GCCGCGAGAA GAACGAGGAA CGCGAGATCT GGGACGGCTA CCGCTGGGGC
CCCGAGGCCG CGCGCGCGGC CTTCGGCTTC GACGAGGCGT GGACCATCGG CGACCTCGAA
AAACGCCTCC CCGACTACCT CGCCGACCAG CCCCTGCTGT GGACCAGCCT CGGCTACGAC
AACGACTGGG ACGCGCGCGT ACTCGGCGCG CTCAACGCCG TGCGTGACAA GGCCCGCACC
GGCCTCACCC CGCCGCACTC GGTGCGCGAC CTGCGCGCCG AGCTCGACGA GATGCGCCTG
GTCAAGGACG CGTCCGAGCT CGCCACCATG CGCCAGGCAG CACAGATCTC CGCCGCCGCC
CACTGCCGCG CGATGCGCGC CACCCGCCCG GGCCGGCACG AGTACGAGAT CGAGGCCGAG
CTGCTGCACG CCTTCCGCGC CGCCGGCAGC CAGGCCCCCG CCTACACCAG CATCGTCGCC
GGCGGCGCCA ATGCCTGCGT GCTGCACTAC GTCGACAACG ACCAGCGCCT CAATGACGGC
GACCTGTTGC TGATCGACGC CGGCTGCGAG CTCGACGGCT ACGCCTCCGA CATCACCCGG
ACCTTCCCGG TGAGCGGCCG CTTCTCAGGT CCGCAGCGCG CGGTCTATGA GCTCGTGCTC
GCCGCCCAGG CCGCGGCGCG CGAGGCCACC CGCCCCGGCG CGCACTGGAA CCAGCCGCAC
GACGCCGCGG TGAAGGTGCT CGCCCAGGGC ATGCTCGACC TCGGTCTCCT CCAGGGCAGC
CTGGACGGCG TGCTCGAGAA CGGCGACTAT CGCCGCTTCT ACATGCACCG CACCGGCCAC
TGGCTGGGCA TGGACGTGCA CGACGCCGGC GAATACAAGC TCGGCGGCGA ATGGCGGCCG
CTGGTCGAGG GCATGGTGCT GACCATCGAG CCGGGCTGCT ACATCCGCGC GGCCGAGGAC
GTGCCCGAGG CCTTCTGGAA CATCGGCATC CGCATCGAGG ACGACGCGAT CGTCACCGCC
GACGGCTGCG CGCTGATCAC CGAGGACGCG CCCAAGGCGG TTGCGGACAT CGAGGCCCTG
ATGCGGGACG CCCGTCATGG CTGA
 
Protein sequence
MNAPGTAPMD TTPFRARRAR LLQRMQAAGG GVAILPTAPE RVRNRDAHYA YRHDSYFYYL 
SAFREPEAVV VLVAGKETKQ ILFCREKNEE REIWDGYRWG PEAARAAFGF DEAWTIGDLE
KRLPDYLADQ PLLWTSLGYD NDWDARVLGA LNAVRDKART GLTPPHSVRD LRAELDEMRL
VKDASELATM RQAAQISAAA HCRAMRATRP GRHEYEIEAE LLHAFRAAGS QAPAYTSIVA
GGANACVLHY VDNDQRLNDG DLLLIDAGCE LDGYASDITR TFPVSGRFSG PQRAVYELVL
AAQAAAREAT RPGAHWNQPH DAAVKVLAQG MLDLGLLQGS LDGVLENGDY RRFYMHRTGH
WLGMDVHDAG EYKLGGEWRP LVEGMVLTIE PGCYIRAAED VPEAFWNIGI RIEDDAIVTA
DGCALITEDA PKAVADIEAL MRDARHG