Gene Tmz1t_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1785 
Symbol 
ID7085755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2007655 
End bp2008755 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content73% 
IMG OID643698807 
Productiron-sulfur cluster binding protein 
Protein accessionYP_002355433 
Protein GI217970199 
COG category[C] Energy production and conversion 
COG ID[COG1600] Uncharacterized Fe-S protein 
TIGRFAM ID[TIGR00276] iron-sulfur cluster binding protein, putative 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.280211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAA AATTGCCGAC GCCGGACGGG GCGGCGGTCG ACGGGACCGT CCTCGACGGT 
GCGGCACTTG TTGCGCGGAT CAGACAGTGG GGGCGCGAAC TCGGCTTCGA CGCGGTCGGC
GTGGGCGGGG TCGATCTCGC CGACGCCGAA CCCGGCCTGG TCGCCTGGCT GGAGGCCGGC
TTCCACGGGG ACATGGATTA TATGGTGCGC CACGGCATGA AACGCGCGCG CGCCGCCGAA
CTCCTGCCCG GCAGCGTGCG CGTGATCAGC GTGCGCATGG GCTACTGGCC GGATGCCGCG
CCGGCCATGG ACGTGCTCGG CGACCCCGAG CGCGCCTATG TGTCGCGCTA CGCGCTCGGC
CGCGACTACC ACAAGCTGGT GCGCAACCGC CTGCAGAAGC TCGCCGACCG CATCAGCGCG
GAGGTGCCGC ACCAGTACCG CGTGTTCACC GACTCGGCGC CCATCCTCGA AGTCGAGCAC
GCCAGTCGCA AGGGCCTGGG CTGGCGCGGC AAGCACACCC TGCTGCTCGA TCGCACCGCC
GGTTCGTGGT TCTTCCTCGG CGAGATCCTC ACCGACCTGC CGCTGCCGGT GGACGCGCCG
GTGGCGTCGC ACTGCGGGCG CTGCACGGCC TGCATCGACG CCTGCCCCAC CGGCGCCATC
GTCGCGCCCT ACCGGCTCGA CGCACGGCGC TGCATCTCCT ACCTCACCAT CGAGCTGCAC
GGCGCGATCC CCGAGGAGCT GCGCCCGTTG CTCGGCAACC GCATCTACGG CTGCGACGAC
TGCCAGCTCG TGTGCCCGTG GAACCGCTTC GCCCAGCTTG GCCGCGAGCC CGATTTCGCC
CCCCGCCAGG GCCTCGACGA CGCCCGGCTG GCCGAGCTCT TCGCGTGGAC CGCGGCGGAG
TTCTCCGAGC GCACCGCAGG CAGCCCGATC CACCGCATCG GCCACGCGCG CTGGCTGCGC
AACATCGCGG TCGCGCTCGG CAACGGCCCG GCGACGCCCG CCGCGCGCGC GGCCTTGCAG
GCGCGCGCGG ACGACGAGGA TGCGGTGGTG CGCGAGCACG TCGCCTGGGC GCTCGCCCGC
CTGGCCGCCG CCGCCGGCTA G
 
Protein sequence
METKLPTPDG AAVDGTVLDG AALVARIRQW GRELGFDAVG VGGVDLADAE PGLVAWLEAG 
FHGDMDYMVR HGMKRARAAE LLPGSVRVIS VRMGYWPDAA PAMDVLGDPE RAYVSRYALG
RDYHKLVRNR LQKLADRISA EVPHQYRVFT DSAPILEVEH ASRKGLGWRG KHTLLLDRTA
GSWFFLGEIL TDLPLPVDAP VASHCGRCTA CIDACPTGAI VAPYRLDARR CISYLTIELH
GAIPEELRPL LGNRIYGCDD CQLVCPWNRF AQLGREPDFA PRQGLDDARL AELFAWTAAE
FSERTAGSPI HRIGHARWLR NIAVALGNGP ATPAARAALQ ARADDEDAVV REHVAWALAR
LAAAAG