Gene GM21_2518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2518 
Symbol 
ID8137860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2946387 
End bp2947397 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content65% 
IMG OID644870127 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003022317 
Protein GI253701128 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.179818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC CAACGACCAT ATCCGACAGA GAGAGTACCG GGCGGCGCAG GATCGCGGTA 
GCGGCATACG AAGGTGCCGA GCTCTTGGAC GTCACCGGCC CCATTGAAGT CTTCAACATG
CTGAACCGTT GCCTCGGAGA GGTAGAGGCC CTTGAGCGCG GCTACAACGT GCTCTTGATG
GCGCAGCAGC CCGGACCCTT CGCTTCTTCG CCGGGGATAA AGCTGGTGGC GGACCTTGCC
TGGCAGGAGC TTACAGCCGG CACGGACTCC ATCTTCGTGC CGGGAAGCCC TGATGACGCT
CTGGCGAAGG CTCTGAAAAA CGAGCCGCTG GTGGAGTGGC TGCGCTCGAC CCCAACGCTC
GCCAAGCGCG TGGTTTCGGT CTGTACCGGC GCCTTTCTAC TAGCCAAGGC GGGGCTCCTC
GACGGGCGGC GCGCCACAAC CCACTGGATG GACCTGGAGC GGTTGGCCCG GGAATACCCG
CAGGTCATGG TGGAGCAGGA CGCCATCTAC ATACGGGACG GGGAGATCGC CACCTCGGCC
GGGGTCACCG CCGGGATGGA TCTGGCCCTG GCGCTGGTCG AGGAGGATTT CGGCCGGAAG
ATGGCGCTCA CGGTGGCCCG GCGCCTGGTC ATGTTCCTGA AGAGGCCGGG GGGGCAGGCG
CAGTTCAGCA CCCAGCTGCG GGCCCAGATG GTGGAAGGGG GGCAGCTCGC CACCCTGCTC
GCATGGATTA AGGATAATCA CTGCCGCAAG GTCACGGTGG AAGAGCTGGC CGGGCGGGCG
GCCATGAGCC CGCGCAATTT CGCCAGGGTC TTCCTGCGGG AGACGGGAAA GACTCCGGCC
CGGTATCTAG ACCAACTGCG TCTGGAGCGC TCGATAAACC TGATGGAGGA CGGCGCGCTC
TCCCTGGACA GGGTCGCCGC CGAGAGCGGT TTCACCTGCG CCGAACAGAT GAGGCGGGTC
TTTATCCGCG AGATGGGGGT AACCCCTCTT GCGTACCGGA CGAGGTTTTG A
 
Protein sequence
MKKPTTISDR ESTGRRRIAV AAYEGAELLD VTGPIEVFNM LNRCLGEVEA LERGYNVLLM 
AQQPGPFASS PGIKLVADLA WQELTAGTDS IFVPGSPDDA LAKALKNEPL VEWLRSTPTL
AKRVVSVCTG AFLLAKAGLL DGRRATTHWM DLERLAREYP QVMVEQDAIY IRDGEIATSA
GVTAGMDLAL ALVEEDFGRK MALTVARRLV MFLKRPGGQA QFSTQLRAQM VEGGQLATLL
AWIKDNHCRK VTVEELAGRA AMSPRNFARV FLRETGKTPA RYLDQLRLER SINLMEDGAL
SLDRVAAESG FTCAEQMRRV FIREMGVTPL AYRTRF