Gene Sde_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1249 
Symbol 
ID3968210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1599354 
End bp1600343 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content49% 
IMG OID637920323 
Productpseudouridylate synthase 
Protein accessionYP_526723 
Protein GI90020896 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.883014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.601505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTA ATTTAGATTT TGCCTACGCC CAAGGGCAGC CCACCCAGAC TGCCACCTTC 
CGCCAATTAG CGGAGGATTT TATTGTAGAT GAGCAGCTAG GCTTTGAATT TAGCGGTGAA
GGTGAACACC TATATGTTCA AATTAAAAAA ACAGGCGAAA ACACGCAATA CGTTGCTAAA
CAACTGGCGC GCTACTTGGG GGTTAAGCCG GTTGCGGTTG GCTTTAGCGG CCTGAAGGAT
CGTCACGCTG TAACTACCCA GTGGTTTAGC GTGCAATTGC CGGGTAAAAA TATTGATATC
GACTGGGCTG ACTTTATCGA AAAAACGCAG CTCAATGTGG AAGTATTACA GCAAGGGCGG
CACAGCGCCA AATTGCGCCG CGGTCAGCAT TTATGTAACG ATTTTGTTAT TACTTTGCGC
GATATTAGCC AAAGTGATGA CCTAGAAGCG CGATTGCAAA CTGTAGCGGC CAACGGTGCC
CCCAATTATT TCGGTGAGCA GCGTTTTGGT ATTGATGGCG GTAACCTTGC GCGTGCGCAG
GCGTGGTTTA GTGGTGAAGA CCCCATTCGC AACAAAAACA TGCAGGGTAT TATTTTATCT
GCTGCACGCT CCTACCTGTT TAACCTAGTG CTTAGCGAGC GAATAAAGCA AGACAATTGG
CTAGCGCCGA TGGACGGCGA CCCCGCAGAA GTACCAACTG GCCCACTATG GGGCCGCGGC
CGACCTAAAT CGACCGATGC ATTGCTAGAG CTAGAAAACG AAGTACTGGC CCACTTGGAC
CTATGGCGAG ATAAGCTCGA GCACAACGGT TTAAGCCAAG AGCGCAGAGA TTTAGTGCTT
AAGCCTCGTT CATTCTCGTG GCAGTGGCAA GACAATGCCT TGGTACTTAG CTTGTCGTTG
GCCCCTGGGC TATATGCAAC ATCGCTGTTG CGTGATGTAT TGCTATTGAA TAACGTTTCA
GCAGAACAAT ACGCCCCTCC TGCAGCCTAA
 
Protein sequence
MSFNLDFAYA QGQPTQTATF RQLAEDFIVD EQLGFEFSGE GEHLYVQIKK TGENTQYVAK 
QLARYLGVKP VAVGFSGLKD RHAVTTQWFS VQLPGKNIDI DWADFIEKTQ LNVEVLQQGR
HSAKLRRGQH LCNDFVITLR DISQSDDLEA RLQTVAANGA PNYFGEQRFG IDGGNLARAQ
AWFSGEDPIR NKNMQGIILS AARSYLFNLV LSERIKQDNW LAPMDGDPAE VPTGPLWGRG
RPKSTDALLE LENEVLAHLD LWRDKLEHNG LSQERRDLVL KPRSFSWQWQ DNALVLSLSL
APGLYATSLL RDVLLLNNVS AEQYAPPAA