Gene Acid345_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3345 
Symbol 
ID4071263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3967692 
End bp3968693 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID637985367 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_592420 
Protein GI94970372 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CACAACACAT CCAGGTCAGC GCAGACGACG CGAACATTCG CCTGGATCAA 
TATCTCGTCT CGCATCTCCC CGACGTCTCA CGCGCTCGCG TACAGGCGCT GATCGACGAC
GAAAAGATCC TGGTAGACGG AAAGTCCTCC AAGCCGTCCT ATAAGCTGCG CGGAAGCGAA
GTGATCGATG TCGTCGGCGA ATATCAGCCG CCGCCGTTGC GCGCGATTCC CGAAGATATT
CCGCTCGATG TGGTGTACGA AGACGATGAT CTCGCGGTCA TCAACAAGCC GGCAGGAATG
ATGGTGCATG TTGGCGCTGG TGCAACGGAG GAAGAGCGTA ATCGCGGGAC GCTGGTGAAT
GCGCTGCTGT ATCGGTTCCG AGCGCTGTCA GAAGTCGGCG GCGACATGCG GCCCGGTATC
GTTCACCGCT TGGACAAAGA GACCAGCGGA CTGATCGTGG TTGCGAAGAA CGACGTTGCG
CACCGCAAGC TCGCGGAACA GTTTTCCTCG CGACGGGTTC ATAAGAAGTA CGTGGCTCTG
GTGCATGGAT GGCCGAAGAA ACTGAAGGGA ACCATCAACC TGCCGATCGC GCGCGACATG
TCGCGCCGCA CGCGGATGAC GACCCGCGGG TCAGGTGGAC GCGATGCGCT GAGCCACTAC
GAAGTGAAGG AGAAGATCGA GTCTCCGTAC GGCAAGTTCG CGCTGGTCGA GGTGAAGATC
GAGACCGGCC GCACCCACCA GATCCGCGTG CATATGGCCA GTTTGGGCCA TCCGGTGGTG
GGCGACACGC TCTACGGTGC GCCGGGTGAG TTGCGGGTTA CGAAGGCGTT GAAAGGGATG
CCATCGAAGA TGGCGTCCCT GGAGCGAAAT TTCCTCCACG CGGCAGAAAT CGAATTGCAG
CAGCCCGCAA CGGGAAAAGC ACTGCGTTTT GTAACCAAAG TTCCAGCGGC ACTAGAGGAT
TTCGCTGAGA CATTGCGGCA TCCGGAAGTA CGCGGGACGT AG
 
Protein sequence
MDGAQHIQVS ADDANIRLDQ YLVSHLPDVS RARVQALIDD EKILVDGKSS KPSYKLRGSE 
VIDVVGEYQP PPLRAIPEDI PLDVVYEDDD LAVINKPAGM MVHVGAGATE EERNRGTLVN
ALLYRFRALS EVGGDMRPGI VHRLDKETSG LIVVAKNDVA HRKLAEQFSS RRVHKKYVAL
VHGWPKKLKG TINLPIARDM SRRTRMTTRG SGGRDALSHY EVKEKIESPY GKFALVEVKI
ETGRTHQIRV HMASLGHPVV GDTLYGAPGE LRVTKALKGM PSKMASLERN FLHAAEIELQ
QPATGKALRF VTKVPAALED FAETLRHPEV RGT