Gene GM21_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1223 
Symbol 
ID8136548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1429076 
End bp1431346 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content64% 
IMG OID644868837 
ProductRNA binding S1 domain protein 
Protein accessionYP_003021042 
Protein GI253699853 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.572551 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATGC ACGCAACAGA AACACCTCAA TTGCTTGAAA TCATAGTCGA AGAAAGCGGG 
CTGCCGCGCG GCGGCGTGCT CCAGACGGTG GCGCTTTTGA ACGAGGGGGG GACCGTCCCC
TTCATCGCGC GCTACCGCAA GGAGCAGACC GGGGAACTGG ACGAGGTGCA GATCCGCCAG
ATCGAGGACC TGCTGATCTA CCACCGCGAG CTGGCCGAGC GCAAGGTCAC GGTGCTGAAG
AGTATCGAGG AGCAGGGAAA ACTCACACCG GAGCTTTCCG CCCGGATCGC GAGCTGCCGC
AAGAAGACCG ACCTGGAGGA CCTTTACCTC CCCTACAAGC CGAAGCGCCG CACCAAGGCC
ACCATCGCTC GGGAGAAGGG GCTGGAGCCC TTGGCCGAGC TGATCGCGGC GCAGGAACTG
GCGACGGGAA GCCCGACCGA GGCGGCGGCC CCTTTCGTCA ACGCGGAGTT GGGCGTAACC
GACGCAGAGG CAGCCCTCGC GGGGGCGGCG CACATCATAG CCGAGCAGTT GAGCGAGAAC
GCGGACCTGC GAGGCAGGGT GCGCGACCTG ACCAGGGAGC AGGGGATATT CGGCTCGCAG
CTGATGGGCG ACGGGGCCGA CAAGGACGGG AAATTCCGGA TGTACCACGA CTACCAGGAG
CCGCTCAAGT CGATACCGTC CCACCGCATG CTCGCCATGC GGCGCGGCGA GAAGGAAGAC
GTCCTGAGGC TCGCCCTGCT GGCGCCGGAA GAGGAGATCG TCGGGAGACT GAAGGGGGCG
TTGGTCAAAC GTGAAAGCAT CTTCAAGGGC ATCCTTGAAG CGGCGGCCGC CGACGCCTAT
AAGCGGCTCA TCGCCCCGTC GATCGAGGTG GAGCTTAGGC TTGAGGCGAA GACCCGCGCC
GACGAGGCGG CCATCGCCGT CTTCGCGCAG AACCTGAAGA ACCTGTTGCT TCTCCCCCCC
GCCGGAGGGA GGCGGGTGCT CGGGATCGAC CCAGGCCTTA GAACCGGCTG CAAGCTCGCA
GCAATCAGCG AGACCGGGCG CTTCCTGGAG CATGTCACCA TCTACCCGCA CACCGGCGGA
GCCAAGGCGG AGGCTGCCGG TGCCGAGCTT GCCGGGATGG TAGAGCGCAA CGACTGCCGC
CTCATCGCCA TCGGCAACGG CACCGGAGGG CGCGAGACCG AGATCTTCGT GCGGGACGCA
TTGAGGAAGG CGGGGATAAA GGCGGAGACG GTGATGGTGA ACGAGGCCGG CGCCAGCATC
TATTCGGCAT CGGACATCGC CAGGGAAGAG TTCCCCGAAC TCGACCTGAC CGTCCGCGGC
GCCATATCCA TCGGCCGAAG GCTCCAGGAC CCTCTCGCCG AACTGGTGAA GATCGATCCC
AAGAGCATCG GGGTCGGCCA GTATCAGCAC GACGTGAACC AGACCCTGCT GAAAAAGGCG
CTGGACGCGG TGGTGGAATC CTGCGTCAAC TTCGTCGGCG TCGACTTGAA CAGCGCGTCG
TGGGCCCTGC TCTCCTACGT TGCCGGCCTC TCCGAAAGCC AGGGGCGGGC CATAGTCAGG
CACCGGGACG AACAGGGGGC CTTCGCCTCG CGGCAGTCGC TCCTCAAGGT GGCGCGCTTC
GGTCCCAAGG CGTTCGAGCA GGCAGCGGGC TTTTTGAGGA TCAGGGGTGG CGAGAACCCC
CTGGACAATA CGGCGGTACA CCCCGAGAAC TACGCGGTGG TCGAGAAGAT GGCGGCTGAC
CTCGGGGTGA GCCTGTCGCA ACTGGTGGCG GACCCGGGTC TTTCGGCCGG TATCAGGATC
GAGCGCTACG TGACCGACAC CATCGGAATC CCGACGCTGC GCGATATCCT GGCGGAGCTT
AAGAAACCGG GGCGCGACCC GCGCGAGCAG TTCCAGAGCG CGAGCTTCCG CGAGGATGTG
GTGACCATCG CCGACCTGAA GGAGGGGATG ATCCTGCAGG GGGTGGTGAC CAACGTCGCC
GCCTTCGGCG CCTTCGTGGA CATCGGGGTG CACCAGGACG GGCTCGTGCA CGTAAGCCAG
CTCACGCACA GGTTCACCAA GGACCCAAAC GATGCGGTCA AGGTGGGGCA AATCGTGAAG
GTAAAGGTGC TTTCGGCTGA TCCGGAGAGG AAGAGAATCT CGCTCTCCAT CAAGCAGGCG
GAACCCGAAA AAGCGCAGAA GAACGCGGAA GTCAGAAAGC CGCAGCCCAA AGAAAAGCCG
GTGAACGAGC AATCGGCGTG GGAAAAGGCG GGTTTCAGGG TGAAGAAGTA A
 
Protein sequence
MTMHATETPQ LLEIIVEESG LPRGGVLQTV ALLNEGGTVP FIARYRKEQT GELDEVQIRQ 
IEDLLIYHRE LAERKVTVLK SIEEQGKLTP ELSARIASCR KKTDLEDLYL PYKPKRRTKA
TIAREKGLEP LAELIAAQEL ATGSPTEAAA PFVNAELGVT DAEAALAGAA HIIAEQLSEN
ADLRGRVRDL TREQGIFGSQ LMGDGADKDG KFRMYHDYQE PLKSIPSHRM LAMRRGEKED
VLRLALLAPE EEIVGRLKGA LVKRESIFKG ILEAAAADAY KRLIAPSIEV ELRLEAKTRA
DEAAIAVFAQ NLKNLLLLPP AGGRRVLGID PGLRTGCKLA AISETGRFLE HVTIYPHTGG
AKAEAAGAEL AGMVERNDCR LIAIGNGTGG RETEIFVRDA LRKAGIKAET VMVNEAGASI
YSASDIAREE FPELDLTVRG AISIGRRLQD PLAELVKIDP KSIGVGQYQH DVNQTLLKKA
LDAVVESCVN FVGVDLNSAS WALLSYVAGL SESQGRAIVR HRDEQGAFAS RQSLLKVARF
GPKAFEQAAG FLRIRGGENP LDNTAVHPEN YAVVEKMAAD LGVSLSQLVA DPGLSAGIRI
ERYVTDTIGI PTLRDILAEL KKPGRDPREQ FQSASFREDV VTIADLKEGM ILQGVVTNVA
AFGAFVDIGV HQDGLVHVSQ LTHRFTKDPN DAVKVGQIVK VKVLSADPER KRISLSIKQA
EPEKAQKNAE VRKPQPKEKP VNEQSAWEKA GFRVKK