Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3594 |
Symbol | |
ID | 8138967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4172346 |
End bp | 4173692 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871214 |
Product | sun protein |
Protein accession | YP_003023373 |
Protein GI | 253702184 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 140 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCAGCA AAAACCCGCG CCGCGCCGCC TTCGACATCC TGCTCCGGAT CGAGAAAGAA AAATCCTTCG CAGACATCCT GATTGACCAC GAACTCTCCA AGGACATCAT CAAGGGAGCC GACCGCGGCC TGCTCACCGA GCTGGTCTAC GGCGTGCTGC GCAGGCAGGG AACGCTCGAT TACATCATCT CCCAGTTTTC CAAGCAAAGG CCGGAGAAGC TTGAGCTTTT CGTGCGGCTC CTGTTGCGCC TGGGGATTTA CCAGTGCTTC TTCCTGGACC GGGTCCCGGT GTCGGCCGCC GTCAACGAGA CGGTGAACCT GGCCAAGGAA CTGGCGCCGC GCGCCTCCGG CTTCATCAAC GCGGTCCTTA GAAACGCCGA CCGCGGCCGC GACACCATAA CCTACCCCGA CCGCGCCGCG CGCCCCGCCG AATACCTCGC CGCGCGCTAT TCCCATCCCG CCTGGCTCGC CCAGCAGTGG TGCGACCAGC TAGGGCTGGA AGCCGCGGAG GAGCTGGCCG CCGCCATGTC CGAACCGCCC CCCTTGACCG TGAGGGTCAA CACGCTGCGC ATCACCCGCG AAGAGCTGAT CCGGAGGTTG GTCGGGGAGG GGGTAAGCTG CAGCGCGACC TCGTGGTCCC CGGACGGCAT CCGCCTGAAC CAGTCCGGGC AGATCACCAG GCTTCCCTCC TTCAGGGACG GCCTCTTCAC GGTGCAGGAC GAATCCTCGC AACTGGCCCC GCTGTTCCTG GCGCCTGGGA AGGGGGAGCG GGTGCTGGAC GCCTGCGCCG CTCCCGGCGG CAAGACTACC CAGATAGCAC AGCTGATGCA GGACTCGGGC GAGATCTATG CCTGCGACGT GAACAACAAG AAGCTCCGGC TGATCAAGGA GACCTGCGAC CGGCTGGGTA TCAACTCGGT CCGCACCTTC ACCATGGACG CCACCGCACC CTCCAACGCC ATCAAGGAGA CCACCTTCCA CCGCATCCTG GTGGACGCCC CCTGCTCCGG CCTCGGCGTG ATCAGGCGCA ACCCGGAGGG GAAGTGGAGC AAGTCCGGCG ACGACCTCTT GCAACTGGCG CGCACCCAGG TCAGCATCCT GGAGAACCTC TGCAGGTACC TGGAACCGAA GGGGACCATC CTCTACGCCA CCTGCTCGAC CAGCATCCAG GAGAACGAGT ACGTGGTGGA CAGCTTCCTC GGAAGCCACC CGGAGTTCGT CGTGGAAGAC CTGCGCCCGC TCTTCCCTCA GTATGCGCCG CTGTTCACCG AGCGCGGCTT CTTCAGGAGC TGGCCGCACC GCGACGGCAT GGACGGCTTC TTTTCGGCGC GCCTGAAGAG GAAGTAG
|
Protein sequence | MSSKNPRRAA FDILLRIEKE KSFADILIDH ELSKDIIKGA DRGLLTELVY GVLRRQGTLD YIISQFSKQR PEKLELFVRL LLRLGIYQCF FLDRVPVSAA VNETVNLAKE LAPRASGFIN AVLRNADRGR DTITYPDRAA RPAEYLAARY SHPAWLAQQW CDQLGLEAAE ELAAAMSEPP PLTVRVNTLR ITREELIRRL VGEGVSCSAT SWSPDGIRLN QSGQITRLPS FRDGLFTVQD ESSQLAPLFL APGKGERVLD ACAAPGGKTT QIAQLMQDSG EIYACDVNNK KLRLIKETCD RLGINSVRTF TMDATAPSNA IKETTFHRIL VDAPCSGLGV IRRNPEGKWS KSGDDLLQLA RTQVSILENL CRYLEPKGTI LYATCSTSIQ ENEYVVDSFL GSHPEFVVED LRPLFPQYAP LFTERGFFRS WPHRDGMDGF FSARLKRK
|
| |