Gene GM21_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1016 
Symbol 
ID8136338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1198934 
End bp1200124 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content53% 
IMG OID644868628 
Productputative transcriptional regulator 
Protein accessionYP_003020836 
Protein GI253699647 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones121 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTAA ATAAACCGCT GGACCAACTG ACGGAAGCAG ACTTCCAGGA ACTCATCGCG 
AATAAAGTCC CCGAGAGCAA AACCCTCGAC TATAAGGTCG ATCTAAAGTT TGGTGACCGG
GATAAGCGGG AGTTCCTCGC CGACGTGTCG TCGTTCGCCA ACACTGCCGG CGGCCACCTG
CTCATTGGTA TCAAAGAGGA GGGCGGCATC CCGACCGCTC TCCCGGGCAT CGACCTCGAC
AACCCCGATA CGGAAAAGCT GAAACTGATT AACCTCATTC GGGACTGCAC TCAGCCGCGC
ATCCCGGGGG TTGCCATCAC ATCGGTCCCT CTCCAGAATT CCCGTTACAT CCTTGCCATC
CATATCCCGA AAAGCTGGGC AGTCCCGCAC GTAGTAAGCA TCGAGAAGCA TTGGCGCTTT
TATGCGCGGC ATTCCGGCGG CAAGTATCAG CTAGACGTCC CAGAACTGCG CCAGGCGTTC
CTCATGTCGG AGTCCCTCGC AGAAAAAATT CGCCAATTTC ACAGCGAACG AGTGGGCATG
GTGATATCCG GCGAAGTACC ACTAAACCTT GCCAACGGGC CGAAGTTCAT TGCCCACATC
ATACCGGTGG ACGCATTTGG GTCAGGACAA CAAGTGGATA TGTCGATGCT GATGGAGAGG
GGCATCCACT TCAATCCGCT GGGCGCCTCT GGCTATAACC GTAGATACAA CCTGGACGGC
TACCTTACGT ATGAAGAGGA AAGAACCCAG GATGCATCCC ATGCGTTGTC CTACACCCAG
CTATTCCGTT CCGGCATTAT CGAATCGGTC TGTGTTGACA AGGACCACCT AAATGCCAAT
GAACGCGATA GAGGCATCCC CATCACTTAC TATCAAGAGC AATTGCTGCG GTTCTTGTCC
GCTTCCCTGC AATCATTGAA ACAACTCGAG GTGGAGCCAC CCTATTCAAT GATGGTCACC
ATGGCCGGCG TGAAACACAG GTATTTGCAT TTCGGCAATA GGTACTTCTC GCTCAGGAAT
CCCTATATTG ATAGGGATGT GCTGCAATTG CCCGACATCC TTATTCAGGA CGCTGACTTC
GCCGGCGGAA AGACAATGCG ACCGATATTC GACGCCATCT GGAATGCCGG AGGGCTTGAA
AGGTGCTTCG ATTACGACGA AGAAGGCAGG TGGAACGGCT ATGGCCAATA A
 
Protein sequence
MSLNKPLDQL TEADFQELIA NKVPESKTLD YKVDLKFGDR DKREFLADVS SFANTAGGHL 
LIGIKEEGGI PTALPGIDLD NPDTEKLKLI NLIRDCTQPR IPGVAITSVP LQNSRYILAI
HIPKSWAVPH VVSIEKHWRF YARHSGGKYQ LDVPELRQAF LMSESLAEKI RQFHSERVGM
VISGEVPLNL ANGPKFIAHI IPVDAFGSGQ QVDMSMLMER GIHFNPLGAS GYNRRYNLDG
YLTYEEERTQ DASHALSYTQ LFRSGIIESV CVDKDHLNAN ERDRGIPITY YQEQLLRFLS
ASLQSLKQLE VEPPYSMMVT MAGVKHRYLH FGNRYFSLRN PYIDRDVLQL PDILIQDADF
AGGKTMRPIF DAIWNAGGLE RCFDYDEEGR WNGYGQ