Gene Tmz1t_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1476 
Symbol 
ID7083559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1644345 
End bp1645355 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID643698494 
Productprotein of unknown function DUF1214 
Protein accessionYP_002355131 
Protein GI217969897 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAT ACCTGATGAT CGCCGCAGTC ACTGGAATCA TGGGACTCGA CCCAGGCGCC 
GGTTTCGCGG CGGAGAAGGT CACGGTGGAC AGCTTCGTGC GCGCCGAGAC CGACATGACG
CTCGACCGCT ACGTCAGGCA GGGCGCGCTG GGCAAACTCA TCCACATCCG CATGCCCGTG
CCGATCGACA GGCAGGACGT GATTCGCATG AATCGCGACA CGCTGTATTC CGCCGGCGTC
TTCGACCTCT CCGCACCGGT CACCATCGTC AAGCCGGAAA CCGGGGGGCG ATTCCAATCG
ATGCTGGTCA TCAACCAGGA CCACTCGATG CTGCCCGCAG AGCACGGCGC GGGCGAGTTC
ACCTTCACCC AGGAGAAGAT GGGCACGCGC TACATGATCG TGCTCTTCCG CACCTTCGTC
GATTCAAACG ACCCGACTGA TATCAAAGCG GCCAACGCCC TGCAGGACAA GATCGTGGTG
AAGCAGGCGG CCCCAGGGAA GTTCGAGATT CCGGAATGGG ATGAGGCCTC ACTGAAGAAG
GTCCGCGATG CCATCAACGT TCTGGCGGCG ACCCGGACCA GCGCCAAGGG CATGTTCGGT
GACAAGGCCA AGCTCGATCC GATCAGCCAC CTGCTCGGCA CGGCCTTCGG CTGGGGCGGG
AATCCGGAAG AAGCCGCCAT TTACGACAAC GTTGTGCCTG CGGAGAACGA CGGCAAGACG
CCCCATTCGG TCACGGTCAA GGACGTGCCT GTCGATGGCT TCTGGTCCAT CACCGTTTAC
AACAAAGACG GCTTCATGGA GAAGAACGAC CAGAACGTCT ACTCGCACAA CAACGTGACG
GCCAAGAAGA ACCAGGACGG GAGCGTGACC ATCCACTTCG GCGCTGGCAC CGATGCGCTC
AACAATGTGC CGATCACCCC GGGCTGGAAC TACATCGTCC GCATGTATCA GCCGCGCAAG
GAAATCATCG ACGGCACCTG GAAGTTCCCG GTCGCCCAAC CGACGAAGTA G
 
Protein sequence
MNRYLMIAAV TGIMGLDPGA GFAAEKVTVD SFVRAETDMT LDRYVRQGAL GKLIHIRMPV 
PIDRQDVIRM NRDTLYSAGV FDLSAPVTIV KPETGGRFQS MLVINQDHSM LPAEHGAGEF
TFTQEKMGTR YMIVLFRTFV DSNDPTDIKA ANALQDKIVV KQAAPGKFEI PEWDEASLKK
VRDAINVLAA TRTSAKGMFG DKAKLDPISH LLGTAFGWGG NPEEAAIYDN VVPAENDGKT
PHSVTVKDVP VDGFWSITVY NKDGFMEKND QNVYSHNNVT AKKNQDGSVT IHFGAGTDAL
NNVPITPGWN YIVRMYQPRK EIIDGTWKFP VAQPTK