Gene Sde_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0133 
Symbol 
ID3967587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp164554 
End bp166077 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content41% 
IMG OID637919192 
Producthypothetical protein 
Protein accessionYP_525609 
Protein GI90019782 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCGC AACAGTTAAA AGATTTAGCT CAAGCCATTT GGCTTGAGCT CGTTCACTAT 
AAAGAATGGT GTGTAGCTAT TTTTATAGTT GTAAGCCTAG CTGCAATTGG CTTGGGGTAT
ACCTGGCCGC AAAGTTATAA AACTAGTGCT GTACTGAATG CGGATGAGCA AAACATCATC
GCACCACTAC TTAGTGGTCG TGCGGAGGTA ACCAAGGTAG ATCGCTCGCA AGACGCTAAA
GAAGTTATCT ATACGCGTAC ATTTTTAGAG AAAGTAGTTA AAGAAAGTGG CTTGCTCGAA
GGAGAACCAA GCCCAGATGC AATAGAAAGA AAAATTTATG CTTTGCGCAG TACAATAGAA
GTTTCTTCTG TTAATAAAAA TTATTTTAGA GTTAGTTATA CCGCGCCAAG CCCCGATAGA
TCTTTCACTG TTTTAAGTGA AACTGTTAGG CAATTTGTTG AGCACACCGC TAAGCAAAAG
AAGGAAGAAA GTTACTCTGC TTATCAGTTT ATTGATTCGC AAGTGCAAGC TTACAAAAGA
CAGCTTGAAG ATGCCGAAGA AAAACTAAAG ATATTTAAAT CGGGCAATAT AGACGGAAGC
GAAGCATCCG TTTCTAGCCG AATTTCTCAA CTTCGAACAG AAATAGAAAA CTTAAAGCTG
TCTATAGATG AAACAAATTC AAGGTTAAAA ACAGTACAAA GCCAGTTAGA TAACGAATCT
TCATATTTGC AGGCTCGTTC GAAATTGGAT TCATTAGAGC AGCGCAAATC TGGTTTGAAT
ACTGAACTAG AAAATTTGCG TCTGTCTTAT CAAGAAAATT ACCCCGATGT TGTATCTCTA
AAGTTGCAAA TTGCAGAGCT AGACAAACGC ATTGAAGAAA TATACCGCTT AGACGGAGTA
ACTACTTCGG GTAGCAGCAG TACCGAACAA AACCCGCTCT ACGAAGAACT GCGTAAGCAA
ATGGCCGTGG CTGAAGTAGA TCTGCGTACG CAGAAACAAC GCAAAGCTTC TTTAGAGCGA
TTGCTTGAGC AAGAGTATGC CCGAGCAGAG AAAATAGCCG CAAACGAAGC AGAGCTATCG
GAGCTTACGC GTGATTACGA TGTAACTAAG CGCGTTTATG AAGAGATGCT CGATAGAAAA
GAAAAAGCTA GAATGTCTAT GGCGTTAGAT GTGGAAGGCC AAGGTGTAAG CTACAAAATT
CACGAGCCAC CGGTTTATCC GTTGCAGTCT ACGGGCCTAA AGTTTATTCA CTTCGCAATA
ATAGGGCCTA TTCTTGGCTT TATAATACCA ATAGGTCTGG TTATAGCTTT AATTATGCTC
GACCCTCGAA TACGTTCTGG TCTTGCTCTA GCGTCAAAAT TTGAAGATGA AGTTGAAGTA
ATCGGTACTA TTCCTCATAG CTCCACAGCG CTAGCTAAGC GTGTGGTGCG GCGAGATGCA
GTAATAATCA TTGGTGTAAC GCTTTGTTTT CTTGCGCTGT ATGCCGTTAT TGTGGCCAAT
AAACTGTTTA GTTCTACCTT ATAA
 
Protein sequence
MDPQQLKDLA QAIWLELVHY KEWCVAIFIV VSLAAIGLGY TWPQSYKTSA VLNADEQNII 
APLLSGRAEV TKVDRSQDAK EVIYTRTFLE KVVKESGLLE GEPSPDAIER KIYALRSTIE
VSSVNKNYFR VSYTAPSPDR SFTVLSETVR QFVEHTAKQK KEESYSAYQF IDSQVQAYKR
QLEDAEEKLK IFKSGNIDGS EASVSSRISQ LRTEIENLKL SIDETNSRLK TVQSQLDNES
SYLQARSKLD SLEQRKSGLN TELENLRLSY QENYPDVVSL KLQIAELDKR IEEIYRLDGV
TTSGSSSTEQ NPLYEELRKQ MAVAEVDLRT QKQRKASLER LLEQEYARAE KIAANEAELS
ELTRDYDVTK RVYEEMLDRK EKARMSMALD VEGQGVSYKI HEPPVYPLQS TGLKFIHFAI
IGPILGFIIP IGLVIALIML DPRIRSGLAL ASKFEDEVEV IGTIPHSSTA LAKRVVRRDA
VIIIGVTLCF LALYAVIVAN KLFSSTL