Gene Anae109_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3167 
Symbol 
ID5375063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3712227 
End bp3713285 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content74% 
IMG OID640844691 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001380347 
Protein GI153006022 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGC ACACCGTCTC CATGCATTTC GTGGGCGCCG CCGTCGCCGG GCTCTCGGGA 
GAGGCGCGCG CGCGGGTGCT GGCGTCCGCC GGCATCCCCT CGGAGCTCCT CGCGGCATCC
CACGCGCGGG TGCCCGCCGA GTCCTTCTCG GCCCTGTGGC TCGCCGTCAA TCGCGAGCTC
GACGACGAGT TCTTCGGCCT CGATCGGCGG CGGATGAAGT GCGGCAGCTT CGCCCTGCTG
TGCCACGCGG TGCTGCACGC CGGGAGGCTC GACCGCGCGC TGCGGCGGAT GCTGAGGGGG
TTCGCGGCGT TCCTGGACGA CGTCCAGGCG GAGCTGCGCG TGGACGGGCC GGACGCGGTC
GTCGCGGTCA CGAACCGCAT CGAGGCCGCC CAGGCTCGCC GCTTCGCGGA CGAGACCTTC
CTCATCATGG TGCACGGGCT GATGTGCTGG CTGGCGGGGC GGCGGATCCC GCTCACGATG
GCGGAGTTCG CGCACCCGCG GCCCACCCAC GCGCAGGAGT ACACCGTCAT GTACTCGCAG
CGGCTGCGGT TCGACGCGGA GCGCACGGCG GTCCGGTTCG ACGCGCAGCT CCTCGCGTTG
CCCGTCGTGC AGAACGCCAC CGCCCTGAAG ACGTTCCTGC GCACCGCGCC GCAGTCGGTG
TTCCTCAAGT ACACGAACGA GGACAGCTGG ACGGCCCGGC TGCGCCGGCG CCTGCGCGGG
AGCATCGGCC GCGAGGAGTG GCCCCGGCTC GAGGACGTGG CGCGCGAGTT CCACGTCGCG
CCGACGACGC TCCGCCGCAG GCTCGACGCG GAGGGGACGA GCTACCAGGG CATCAAGGAC
GAGCTGCGCC GGGACGCGGC CGTCCATCAC CTGTGCGGCA GCCGCCTGAG CGTCGCCGAG
ATCGCCGCCT CCCTCGGCTT CCAGGAGACG AGCGCGTTCC ACCGCGCGTT CAAGCGCTGG
AGCGGCGTGC AGCCCGGGGA GTACCGCAGG CGGCAGGCCG AGCTCGGGCC GGGGCGGGCG
GACGACGCGC CGCCCCCGCC CGCCCTGGCG CGAGGCTGA
 
Protein sequence
MQKHTVSMHF VGAAVAGLSG EARARVLASA GIPSELLAAS HARVPAESFS ALWLAVNREL 
DDEFFGLDRR RMKCGSFALL CHAVLHAGRL DRALRRMLRG FAAFLDDVQA ELRVDGPDAV
VAVTNRIEAA QARRFADETF LIMVHGLMCW LAGRRIPLTM AEFAHPRPTH AQEYTVMYSQ
RLRFDAERTA VRFDAQLLAL PVVQNATALK TFLRTAPQSV FLKYTNEDSW TARLRRRLRG
SIGREEWPRL EDVAREFHVA PTTLRRRLDA EGTSYQGIKD ELRRDAAVHH LCGSRLSVAE
IAASLGFQET SAFHRAFKRW SGVQPGEYRR RQAELGPGRA DDAPPPPALA RG