Gene GM21_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3594 
Symbol 
ID8138967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4172346 
End bp4173692 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID644871214 
Productsun protein 
Protein accessionYP_003023373 
Protein GI253702184 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones140 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAGCA AAAACCCGCG CCGCGCCGCC TTCGACATCC TGCTCCGGAT CGAGAAAGAA 
AAATCCTTCG CAGACATCCT GATTGACCAC GAACTCTCCA AGGACATCAT CAAGGGAGCC
GACCGCGGCC TGCTCACCGA GCTGGTCTAC GGCGTGCTGC GCAGGCAGGG AACGCTCGAT
TACATCATCT CCCAGTTTTC CAAGCAAAGG CCGGAGAAGC TTGAGCTTTT CGTGCGGCTC
CTGTTGCGCC TGGGGATTTA CCAGTGCTTC TTCCTGGACC GGGTCCCGGT GTCGGCCGCC
GTCAACGAGA CGGTGAACCT GGCCAAGGAA CTGGCGCCGC GCGCCTCCGG CTTCATCAAC
GCGGTCCTTA GAAACGCCGA CCGCGGCCGC GACACCATAA CCTACCCCGA CCGCGCCGCG
CGCCCCGCCG AATACCTCGC CGCGCGCTAT TCCCATCCCG CCTGGCTCGC CCAGCAGTGG
TGCGACCAGC TAGGGCTGGA AGCCGCGGAG GAGCTGGCCG CCGCCATGTC CGAACCGCCC
CCCTTGACCG TGAGGGTCAA CACGCTGCGC ATCACCCGCG AAGAGCTGAT CCGGAGGTTG
GTCGGGGAGG GGGTAAGCTG CAGCGCGACC TCGTGGTCCC CGGACGGCAT CCGCCTGAAC
CAGTCCGGGC AGATCACCAG GCTTCCCTCC TTCAGGGACG GCCTCTTCAC GGTGCAGGAC
GAATCCTCGC AACTGGCCCC GCTGTTCCTG GCGCCTGGGA AGGGGGAGCG GGTGCTGGAC
GCCTGCGCCG CTCCCGGCGG CAAGACTACC CAGATAGCAC AGCTGATGCA GGACTCGGGC
GAGATCTATG CCTGCGACGT GAACAACAAG AAGCTCCGGC TGATCAAGGA GACCTGCGAC
CGGCTGGGTA TCAACTCGGT CCGCACCTTC ACCATGGACG CCACCGCACC CTCCAACGCC
ATCAAGGAGA CCACCTTCCA CCGCATCCTG GTGGACGCCC CCTGCTCCGG CCTCGGCGTG
ATCAGGCGCA ACCCGGAGGG GAAGTGGAGC AAGTCCGGCG ACGACCTCTT GCAACTGGCG
CGCACCCAGG TCAGCATCCT GGAGAACCTC TGCAGGTACC TGGAACCGAA GGGGACCATC
CTCTACGCCA CCTGCTCGAC CAGCATCCAG GAGAACGAGT ACGTGGTGGA CAGCTTCCTC
GGAAGCCACC CGGAGTTCGT CGTGGAAGAC CTGCGCCCGC TCTTCCCTCA GTATGCGCCG
CTGTTCACCG AGCGCGGCTT CTTCAGGAGC TGGCCGCACC GCGACGGCAT GGACGGCTTC
TTTTCGGCGC GCCTGAAGAG GAAGTAG
 
Protein sequence
MSSKNPRRAA FDILLRIEKE KSFADILIDH ELSKDIIKGA DRGLLTELVY GVLRRQGTLD 
YIISQFSKQR PEKLELFVRL LLRLGIYQCF FLDRVPVSAA VNETVNLAKE LAPRASGFIN
AVLRNADRGR DTITYPDRAA RPAEYLAARY SHPAWLAQQW CDQLGLEAAE ELAAAMSEPP
PLTVRVNTLR ITREELIRRL VGEGVSCSAT SWSPDGIRLN QSGQITRLPS FRDGLFTVQD
ESSQLAPLFL APGKGERVLD ACAAPGGKTT QIAQLMQDSG EIYACDVNNK KLRLIKETCD
RLGINSVRTF TMDATAPSNA IKETTFHRIL VDAPCSGLGV IRRNPEGKWS KSGDDLLQLA
RTQVSILENL CRYLEPKGTI LYATCSTSIQ ENEYVVDSFL GSHPEFVVED LRPLFPQYAP
LFTERGFFRS WPHRDGMDGF FSARLKRK