Gene DvMF_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2024 
Symbol 
ID7173943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2506764 
End bp2507945 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content73% 
IMG OID643540541 
Productpseudouridine synthase 
Protein accessionYP_002436435 
Protein GI218887114 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC CGCTCGTGGT CACCGCCGCA GAGGCCGGAC AGAAGCTGGT GCAGTACCTG 
CAACGGCGCT GCGGCGCGCC GCAGTCGGCC ATCCAGCGCT GGGTGCGCAC CGGGCAGGTG
CGCATCAACG GGGGGCGCTG CAAGCCCTTT GACCGCGTGG CCGAGGGCGA CGTGGTGCGG
GTGCCGCCTT TTGCCCTGGC GGGCGGAGAG GGGGACCCGG TTGTCGGGGG GGGTGCGCCG
GATGCAGGGC AGGGTGCAAC GCGGGCTGCG TCGGCGAATG CGGGCGGAGG AAAAGAGCGG
GGCTCCGCCC CGCGCCCCGC AAGGGGACCG TATCCCCTTG ACCCCGAACC GGGTGAGGGT
GCGGCCTGTG GTACGGCAGC AGCGGGAAGC ACGGCGGCAC CAGGTCCACG GGCATGCGGC
ACACGCCCGG AATTTCAGGC ATCACCCTTG TCCGTGGCGG GCCGCGCCGA AGGGCTGCTG
GTGCTGCTGA AGCCCGCCGG GCTGGCGGTG CAGCCGGGCA CCGGCCACGA TGATTGCGTT
ACCGCCCGGC TTGCGGCGCA GTACGCCGGG GCGGACTTTC TGCCCACGCC CGCGCACCGG
CTGGACCGCG ACACCTCGGG CCTGCTGCTG GTGGCCACCA GTTATGCCCG GCTGCGGGCG
CTGTCCGACG CCTTTGCGGC GCGCGAAGGG CTGGTGAAGG AATACCTGGC CTGGGTGGCG
GGACGCTGGC CCCACGAGGG CGCGCGGACC CTGCATGACC GGCTGGAGAA GCAGGGCGCT
CCGGGCCGCC AGAAGGTGCG CCGGGTGGGC GGGGAGGGTT CCGTTCCTCG CGCGGCGTCT
GGCAATGAAG CGGTTCGCGT GGAGTCTGGT ACGGACGCGG CCCATGCCGC CGCCGGTGCT
GACGCTGGCC GCCACGCCGC CTGCACCGTC ACCCCCCTGC GGCGCGGCGA TGGGGCGTCC
CTGCTGCTGG TGCGCCTGCA CACCGGGCGC ACCCACCAGA TCCGGGTGCA GCTTGCGGAG
CGGGGACACC CCATCATGGG CGACCGCAAG TACGGTGGGC CCGCCTGTGG TCAGGGCATG
CTGCTGCACG CCGTACGCCT GACCCTGCCC GACGGCGAAC GCTTCACGGC CCTGCCGGAC
TGGACGGGCC GCTGGCAGGT GGGCGAGGGC GATCTGCCCT AG
 
Protein sequence
MAEPLVVTAA EAGQKLVQYL QRRCGAPQSA IQRWVRTGQV RINGGRCKPF DRVAEGDVVR 
VPPFALAGGE GDPVVGGGAP DAGQGATRAA SANAGGGKER GSAPRPARGP YPLDPEPGEG
AACGTAAAGS TAAPGPRACG TRPEFQASPL SVAGRAEGLL VLLKPAGLAV QPGTGHDDCV
TARLAAQYAG ADFLPTPAHR LDRDTSGLLL VATSYARLRA LSDAFAAREG LVKEYLAWVA
GRWPHEGART LHDRLEKQGA PGRQKVRRVG GEGSVPRAAS GNEAVRVESG TDAAHAAAGA
DAGRHAACTV TPLRRGDGAS LLLVRLHTGR THQIRVQLAE RGHPIMGDRK YGGPACGQGM
LLHAVRLTLP DGERFTALPD WTGRWQVGEG DLP