Gene GM21_3459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3459 
Symbol 
ID8138831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4002506 
End bp4003465 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content65% 
IMG OID644871079 
Productpseudouridine synthase, RluA family 
Protein accessionYP_003023239 
Protein GI253702050 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGA TCGAACTCAC CTTCCCCAGC GATTCCGATC CGGAGCGGCT GGACAGCTTC 
ATATCCCGCA ACGTACCGGA GCTGACCCGC TCGGCGGCCC TAAGACTCAT AGAGACCGGG
TTCGCCACCG TCAACGGCTC GAAGCAAAAG CCCGCCTTGA AACTCAAGGG AGGCGAAGTG
GTCCAGGTGG TGGTCCCGCC TCCGGTCCCC GCCGAGCCCC AGCCCGAGGA GATCCCCATC
GAGGTGCTCT ACGAGGATGG CGAGCTGGTG GTGGTGAACA AGGGGGCGGG GATGGTGGTG
CATCCCGGCG CCGGCAACCC GGAAGGGACG CTGGTGAACG CCCTGCTGGC CCACTGCAAG
GACCTTTCCG GCATCGGCGG GGAACTGCGT CCCGGCATCG TGCACCGCAT CGACAAGGAC
ACCTCCGGCA CCCTGGTCGT CGCCAAGAGC GACCGGGCCC ACAACGCCCT GGCCGAACAG
TTCAAGGAAC ACACCATAAA AAGGGTCTAC CTCGCCCTCG TCTACGGCTC GCCCAAGGAA
GACAAGGGGA GAATCGAGTC GAGCATCGGC CGCCATCCCA CCGACCGAAA GAAGATGTCC
GCGAAGGCGC GCCACGGCAA GCAGGCGGTG ACCCACTGGC GGGTGGTGGC CCGCTACCCC
GGCATTACGC TGATACGCCT GAGGCTCGAG ACCGGCCGCA CCCACCAGAT CAGGGTGCAC
ATGTCAGAGG CCGGGCACCC GCTTTTGGCA GACGAGGTGT ACGGCGGCAC GGGAAGGCTC
TCCGGCGTGC AGGACCCGGT CCTCAAACAG ATGATCAAGT CGATGGGGCG GCAGGCGCTG
CACGCGAAGA CGCTCGGCTT CCTGCACCCG GTCTCCGGAA AGTACCTGGA GTTCGACACG
GAACTCCCCC CGGACATGGC AGGCATCGTC GCCTATCTGG AAGAAAAAAA CAGGGGCTAA
 
Protein sequence
MEPIELTFPS DSDPERLDSF ISRNVPELTR SAALRLIETG FATVNGSKQK PALKLKGGEV 
VQVVVPPPVP AEPQPEEIPI EVLYEDGELV VVNKGAGMVV HPGAGNPEGT LVNALLAHCK
DLSGIGGELR PGIVHRIDKD TSGTLVVAKS DRAHNALAEQ FKEHTIKRVY LALVYGSPKE
DKGRIESSIG RHPTDRKKMS AKARHGKQAV THWRVVARYP GITLIRLRLE TGRTHQIRVH
MSEAGHPLLA DEVYGGTGRL SGVQDPVLKQ MIKSMGRQAL HAKTLGFLHP VSGKYLEFDT
ELPPDMAGIV AYLEEKNRG