Gene Phep_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3854 
Symbol 
ID8254988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4624309 
End bp4625355 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content45% 
IMG OID644937518 
Productpseudouridine synthase 
Protein accessionYP_003094107 
Protein GI255533735 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATT CTACCACCCG ATTAAATAAA TACATTAGTG AAAGCGGCCT ATGCTCCAGA 
AGGGCAGCCG ACAGGTACAT AGAGCAGGGA AATGTATACA TCAACGGCAA AAAAGCCAAG
GTAGGTGATA AAGTGTTTTT TGGGGATATT GTAACGGTTA ACGGACAAAC TATAGACCCT
AAGGAAGTTG AAAATTCTGT TTTAATTGCA TATAACAAGC CTGTTGGCAT TACCAGTACC
ACCGAAGCCG GGGTAAAAGG AAATATTGTA GATCATGTTA ACCATAGCGA ACGCGTGTTC
CCCATTGGCC GTTTGGATAA AGATTCACAG GGACTGATAT TTCTGACCAA CAATGGCGAC
CTGGTAAATA AAATACTCCG GGCAGGTAAC AACCACGAAA AGGAATACGT AGTAACCGTA
AACAAACCCC TTACAGATCA GTTTATAACC GGAATGGCTA AAGGTGTTCC TGTTTTGGGT
GTAATGACCA AAAAATGTAA GGTAGTTAAA GAAAGCCCGC TCATATTCAA AATCACATTG
ATCCAGGGCC TGAACAGGCA GATCAGGAGA ATGTGTGAGT ATTTTGGCTA CGAAGTAACC
AAGCTGGAGC GTGTAAGGAT CATGAACATT CCATTAAAAG GCATTCCTTT AGGAGAATGG
CGTGAGCTCA CACCAGAAGA ACTGACTGGA ATATTTAATA TGGTTGCCAA ATCAAGTGCG
GAAGCAGATG CAACACCCAA AAGAGTAAGC AAAAAACCTA AGACACCAAA AGAGGAGGAT
TTTTTAGAAA ATGTGACCCC TGGGAGATCT TCAGGAAGAA ATGCTAAACC CAAAAGTAAA
GGAAAAGCTG CCCCAGCTTC AGTAAGACGG GAAGATCGAC CAGCTGGAGG CAAGGGCAAA
TCCGGCCCTA AAACTTCCGC TGCTTTTAAA ACAACAGGGG CGGCAGGCGA CTGGAACAAA
AGTGGCGGTC CGTCCAAAAG ACCCGTCAAA CCAACAAAGG GCCGCTCCGG GGCCCCTGCT
GCCAAAACGA GAAGCCCTAA ACGATAG
 
Protein sequence
MSDSTTRLNK YISESGLCSR RAADRYIEQG NVYINGKKAK VGDKVFFGDI VTVNGQTIDP 
KEVENSVLIA YNKPVGITST TEAGVKGNIV DHVNHSERVF PIGRLDKDSQ GLIFLTNNGD
LVNKILRAGN NHEKEYVVTV NKPLTDQFIT GMAKGVPVLG VMTKKCKVVK ESPLIFKITL
IQGLNRQIRR MCEYFGYEVT KLERVRIMNI PLKGIPLGEW RELTPEELTG IFNMVAKSSA
EADATPKRVS KKPKTPKEED FLENVTPGRS SGRNAKPKSK GKAAPASVRR EDRPAGGKGK
SGPKTSAAFK TTGAAGDWNK SGGPSKRPVK PTKGRSGAPA AKTRSPKR