Gene Anae109_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3949 
Symbol 
ID5377868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4608552 
End bp4610261 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content80% 
IMG OID640845473 
Productheat domain-containing protein 
Protein accessionYP_001381111 
Protein GI153006786 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0775778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAGG GCGGGCGAAT GGACGGGGTG GAAGGGGCGA GGACCATCGT CGAGCCGGAG 
GAGTTCCTCC GCAGCGCGCT CGAGAAGATC GTGTTCTTCG AGTGCCGCGT CTCGCAGCTC
GAGGCCGAGC TCGCCGCGGC GCGAACCACC GCCGAGCGCG CGCGCGGCGA CGCGGCCCAG
GCGCGCCGGC GCGAGGTCGA GCTCGAGCAG GCCCTGGCGG CCGAGCGCGG CCAGCGCGGC
GACGCCGAGG CGCGCGTCGA CGAGGTGGAG GAGCGCGTGC GCCTGCTCCA GGCCGAGCGC
GAGCGGCTCC TCGGCGGGCT CGTCGAGCGC GCCCGGCTCT CCGGGGCGAC CGGCTGCGAC
GGCGCGCCGG GCCCGGAGGA GGGCGGCGCG GACCTGGCCG GCTTCATCGC CGAGCTGCGC
GCCGAGATCG AGTCGCTCCG CGCGTGGAAG GTCGCCGCCG AGGCCGCGGG CCTCGGCGAT
CCGGCGCGCT CCGCCGAGGC GCCGGCGCAC GGAGGCGTCC GCGCCGAGCG GCCGTCCTCG
CAGCGGCACG GGCACGACGC CGGCGCGGGC CCCGAGTCGG TGGCGATGGT CGCGGACCGG
TTCGCCTCCG ACGGCCGGGT CGGCCTCACC GCGCGGGACA ACGACCACAT GAAGGCGCTG
CTCGCGACGC GCGCCGATCG TGCCCTCTAC GAGCGCTCGA TGGACGACCT CTCCGCGCCC
GACGCGGGCC GGCGCCTGCG CGCGGTGCGC GCGCTCGAGG CCCTCGGCTC GAAGGCCGCC
GCGCCGCTGC TCGCCGCCGC CCTCGGCCGC GAGCCGGAGG CCGAGGTGAA GGCCGCGCTG
CTCGGCGCCC TGGCCCGGTT CAAGGAGCCG TTCGCCGCCG AGCTCGCCGT GCGCGCCCTC
GAGGACGGGC GCCCGGCGGT GCGGGTGGCG GCGCTCGAGG CCATCGTGGC GGTCGCGTCC
GGGTCGGCCC AGGAGCGGCT CTCGGCGGCG CTGCGCGACC CGAGCCCCCT CGTCCGCCGC
CGCGCGGCCC TGCTCCTCGG CTTCGCCACC GGCGACCGGG CAGAGGAGGC GCTGGCGGCC
GCGCTGGCGG ACCCGGATCG CGGCGTCGCC CGCGCCGCCG CCGCGGCGCT CTCCGGCCGC
CCGACCGCGC GCGCGCAGGG CGCCCTCGCC CGCGGGCTCG AGCACCCGGA CGCCTCGGTC
CGACGCTCCG CGGCGACCGC CCTCGGCCGG CTCGCCGGCG AGACCGTGGA CTCCGACGCG
CCCGCCTCCG CCCGCCGGGC CGCCTCGCGG CGCATCGCCG AGAAGCTCGC CGCGATGGGA
GGGGAGGAGA TCCGCGCCGC GGTCCTCGCG GCGGCGCCAG GAGCGCCCGC GACCCCGCTC
GTGGTTCGAC AGGCTCACCA CGAGCGGCCG AGAGAGTCCG CTCGCCCTGA CCCCTCGGCA
GGCTCGGGGA AAGGGCGAGC GATCGCGACC GCGTCACCCG CCGCGCTCCT CTCCCGCGCG
GCCGTGGCGG TCGTCGAGGT CGCGCCGGCG CCCGGCGAGG ATCTCTCGGC CGGGATCGTC
TTCGAGGTCC GCGCCGCCCT CCGCGGCTGC ACCGCCGCCG ACCTCTCGAA CGTGCTCGGC
GCCGAGCCGG GCCGCGTCGA TGCCGCCCTC GCCGCGCTCG CCGGGCGCGG CACCCTCGTC
CAGCGCGGCG CGCGCTGGTT CATGGCCTAG
 
Protein sequence
MGQGGRMDGV EGARTIVEPE EFLRSALEKI VFFECRVSQL EAELAAARTT AERARGDAAQ 
ARRREVELEQ ALAAERGQRG DAEARVDEVE ERVRLLQAER ERLLGGLVER ARLSGATGCD
GAPGPEEGGA DLAGFIAELR AEIESLRAWK VAAEAAGLGD PARSAEAPAH GGVRAERPSS
QRHGHDAGAG PESVAMVADR FASDGRVGLT ARDNDHMKAL LATRADRALY ERSMDDLSAP
DAGRRLRAVR ALEALGSKAA APLLAAALGR EPEAEVKAAL LGALARFKEP FAAELAVRAL
EDGRPAVRVA ALEAIVAVAS GSAQERLSAA LRDPSPLVRR RAALLLGFAT GDRAEEALAA
ALADPDRGVA RAAAAALSGR PTARAQGALA RGLEHPDASV RRSAATALGR LAGETVDSDA
PASARRAASR RIAEKLAAMG GEEIRAAVLA AAPGAPATPL VVRQAHHERP RESARPDPSA
GSGKGRAIAT ASPAALLSRA AVAVVEVAPA PGEDLSAGIV FEVRAALRGC TAADLSNVLG
AEPGRVDAAL AALAGRGTLV QRGARWFMA