Gene Tmz1t_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3000 
Symbol 
ID7874390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3251556 
End bp3252989 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content73% 
IMG OID643699921 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_002889976 
Protein GI237653662 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG CCGAATGGCT GCTCGCCGGC GATCCCGCGC GGATCGCGCT GATCGAGGGC 
GATGTCCGCA TCGACTACGC CGGCCTGCGT GCCGATTCCA TCCGCAGCGC CGCGGCCCTG
CTCGACGCCG GGCTGCGCCC CGGCGACGCC TGCGTGCTCG CGCTGCCCGA CGGCATCGAA
TGGGCCGCCG CCTTCATCGG CATGCTGTGG GCGGGGGTGC GGCCGATCGC GATCAACCCG
CGCACCGCCA CCACGCAACT CGCCGACCTG ATGCTGGACT CGGGCGCCGC CGCGGCGCTG
CTCGAGGACG AAGCCGCGCG TGCGCTCGGC GACAAGCGCG CGATCGACCT CGGCGAATGG
CGCCGCCGGG TGGAGCGCGC CACCGCGACG CCCGCAGCCG CGCAGGCAGC CGACGACGAT
CCGGCGTTCC TGCTGTACTC CTCCGGCACC ACCGGCCGCC CCAAGGGCAT CCTCCATGCC
CACCGCGCGA TCCGTTATGC CCACGTGTTC GCCCGCGACC TTCTCGGTGC GCGCCCGGAG
CACCGCTTCT ATTCCAGCTC CAAGCTGTTC TTCGCCTACC CGCTCGCCAA CGCCTTCTTC
GCCGGCCTGC GCCTGGGCGC CACCGTCGTG CTCGACCCCG AATGGCCCGA CCCGGCGCGC
GTCGCGGCGA TGGTCGAGCG CCACGAGCCG CACATCTTCT TCAGCGTGCC CACCCTCTAC
CGCCGCCTGA TCGATGCCGG CGTGCGTTTC CGCGGCGTGC ACGCCGCGGG CTCCGCCGGT
GAAGCCTGTC CGCCCGCGCT TGCGCGCGAC TGGCAGGCGA TGACCGGAGT GCCGCTGGTC
AATGGCTACG GCACCACCGA GACGCTGTCG CTGGTGCTCT ACCGCACGCC GGAGATGGAC
GCCGCGTGCC CCACTCCACT CACCGAGATC CATCCGGAGC AGCTCACCAG CGGCGAGCTC
GAGACCTGGC GGCTGTGGTT CTCCCACCCC GCAGTCGCGC TCGGTTATAC GCGCGTGGTC
ACCCACGACA GCGCGCGCTT CGCCGACGGC CGCTTCGCCC CCGGCGACGT GTTCCGCCGT
GCCCCCGACG GGGAAGGCTG GCTGTTCGCC GGCCGCAGCG ACCAGCTCGT CAAGGTGTTC
GGCCGCTGGG TGGACGTGGT CGCGGTCGAG CAGGCCGTGC AGGAACGCAT GCGCGGCAAG
GCCGAGGAGG TGTGCGTGAT CCCGGCGCAG GGCGAGGACG CGGACATGAT CCGCCTGCAC
CTCTTCGCCA TTCCCGGCGA CCTGCCGCCA CCGCAGGTGC TGGCCGCCGC GCAGGCCGCG
ATCGAGAGCC TGCCGCCCTA CCAGCGCCCC GAGAAGATCC ACCTCGTGGA CCACTTCCCG
CGCACCGACA CCGGCAAGCT GCGCCGCAAC GAGCTCGCGC GCAGCACCGG CTGA
 
Protein sequence
MNAAEWLLAG DPARIALIEG DVRIDYAGLR ADSIRSAAAL LDAGLRPGDA CVLALPDGIE 
WAAAFIGMLW AGVRPIAINP RTATTQLADL MLDSGAAAAL LEDEAARALG DKRAIDLGEW
RRRVERATAT PAAAQAADDD PAFLLYSSGT TGRPKGILHA HRAIRYAHVF ARDLLGARPE
HRFYSSSKLF FAYPLANAFF AGLRLGATVV LDPEWPDPAR VAAMVERHEP HIFFSVPTLY
RRLIDAGVRF RGVHAAGSAG EACPPALARD WQAMTGVPLV NGYGTTETLS LVLYRTPEMD
AACPTPLTEI HPEQLTSGEL ETWRLWFSHP AVALGYTRVV THDSARFADG RFAPGDVFRR
APDGEGWLFA GRSDQLVKVF GRWVDVVAVE QAVQERMRGK AEEVCVIPAQ GEDADMIRLH
LFAIPGDLPP PQVLAAAQAA IESLPPYQRP EKIHLVDHFP RTDTGKLRRN ELARSTG