Gene GSU3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3373 
Symbolsun 
ID2686993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3707306 
End bp3708652 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID637128067 
ProductSun protein 
Protein accessionNP_954413 
Protein GI39998462 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCAAT CCACTGCCGA CCCGCGCCGT TCGGCTTTTG ATATCCTCGT CCGCATTGAA 
CGGGAGCGAA CCTTCGCCGA ACCGCTCATC GACAGGGAAT TGTCCGGTGG CGCCCTGAAG
GGACCAGACC GGGGGCTCCT GACCGAGCTC GTCTATGGGG TGCTGCGTCG AACAGCGACC
CTCGACTACC TGGTGGACCT GTTCTGCGCC ACTCGCGCCG CCAAGCTGGA GCGATCAGTG
CTGATCCTGC TCCGTCTGGG GCTTTACCAG ATTTTCTTCC TTGACCGGAT TCCGGTGTCC
GCCGCAGTCA ACGAAACGGT AACGCTGGCC CGTGAAAAAT CACCACGGGC CAGCGGCCTC
GTAAACGCAG TCCTGCGACG GTCCGACCGG GAACGGTCAT CGATCGCCTG GCCTGACCGG
GTGCGGGACC CGGCCGGCTA CCTCGCCCTC CGTCACTCCC ATCCCCGCTG GATCGTGGAA
GGGTGGATCG CCCAGCTGGG GTTCGAAGAG GCGGAGGCGC TTGCCGAGGT CATGGCCGCG
CCCCCTCCCC TGACCCTCCG GGTCAATACC CTGCGGACCT CCCGCGAGGC ATACCTTGAA
CTCCTGCGGG AGGCAGGCAC GGAAGCGGAG CCCACGCGCC ATTCCCCCCA CGGCATCCGT
ATTCTCTCAC GGACAGCGGT GCCGGCGCTG CCGGGCTTTG GCGAGGGGCT CGTCATCGTG
CAGGACGAAT CCTCTCAACT GGCATCGCTC CTCCTGGAGC CCCGGAGCGG TGAACGGGTC
CTCGATGCCT GCGCTTCCCC CGGCGGCAAG GCGACCCACC TTGCCCAGAT CATGGCCGAC
AAGGGAGAGG TCATTGCCTG GGACGTGTCG GAGAAAAAGC TCTCTCCGAT TGCTGAAAAT
GCCCGCCGTC TGGGCATCGG CATCATTCGG CCCGCCATGG CCGATGCCCG GAATCCGGAG
CAGAACGCCG CTCCCTTCGA CAGGATTCTG GTGGACGCCC CCTGCTCGGC ACTGGGAGTG
CTGCGCCGCA CCCCTGAGGG GAAGTGGTGG AAGACCCCTG ACGACGTGGC ACGGCTGGCC
CAGAGCCAGT GCCGGATACT GGCGGGGGCC GCCTCCCTGC TGAAGCCGGG CGGCACGCTC
CTCTACTCCA CCTGCTCCAC CACAACGGAC GAAAATGAGT CAATTATCGA GGATTTCCTT
TCGCGCCGCG CCGATTTTAT GTTAGAAGAC TTGAATTATC TTTTCCCCGG CCTGTCAGAA
TGCATCACCG ACCGGGGTAT GTTCCGCAGC TGGCCCCACC GCCACGGCAT GGACGGTTTT
TTCGCCGCCC GCCTGCGTCG GGCCTGA
 
Protein sequence
MNQSTADPRR SAFDILVRIE RERTFAEPLI DRELSGGALK GPDRGLLTEL VYGVLRRTAT 
LDYLVDLFCA TRAAKLERSV LILLRLGLYQ IFFLDRIPVS AAVNETVTLA REKSPRASGL
VNAVLRRSDR ERSSIAWPDR VRDPAGYLAL RHSHPRWIVE GWIAQLGFEE AEALAEVMAA
PPPLTLRVNT LRTSREAYLE LLREAGTEAE PTRHSPHGIR ILSRTAVPAL PGFGEGLVIV
QDESSQLASL LLEPRSGERV LDACASPGGK ATHLAQIMAD KGEVIAWDVS EKKLSPIAEN
ARRLGIGIIR PAMADARNPE QNAAPFDRIL VDAPCSALGV LRRTPEGKWW KTPDDVARLA
QSQCRILAGA ASLLKPGGTL LYSTCSTTTD ENESIIEDFL SRRADFMLED LNYLFPGLSE
CITDRGMFRS WPHRHGMDGF FAARLRRA