Gene Sde_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1899 
Symbol 
ID3966963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2399867 
End bp2401054 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID637920984 
Producthypothetical protein 
Protein accessionYP_527371 
Protein GI90021544 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.598311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTC ACCTGTTTTA CACCTATCAT TTAAAGGCAG TTTTTAAAGA TAGGCGAAAC 
ATGAACGTTC AACAGTACAG CATAAACACG CTACCACTAG AACCGACCCA AGAAACCGAT
TTAGTGTTCA TTTTTTCAGG TGAGCACCGC GATGAGTTTG CTGCAAACTA TGAGCGTGCA
CGCACAGCCT TCCCTTGCGC CACGATTGTA GGCTGTTCGA CAGCAGGCGA AATAGAAGAC
ACCAGGGTAA CAAGCAATCA TTGCGTAATA ACTGCGGTTA CCTTCAAGAA TACGTCAACA
TTCAGCGTGT GCGCCGAAGT CACAGATATA GCCCACTCCA AACGTGCAGG AGCACAATTA
GCTCAAAAGC TTCTCGCCCA CAGAAACACA CCGCTAAAAC ATATTTTTGT GCTCTCAGAC
GGCTTGAATG TCAATGGCAG CGAACTGGTT ACCGGCCTAG TTGAATCTGT ACCCAGCTAC
ATATCCGTTA CCGGTGGCCT TGCAGGCGAC AATGGTGACT TCACTTTTAC CCAAACCATC
GCAAACGGCG CCCCCAAAAC CGGCCAAGTG GTTGCCGTAG GGTTTTATGG CGACGGCATT
CATGTTGGTC ACGGTTGTAT GGGCGGCTGG GACCCCTTTG GCACCGAGCG GCTAATAACA
CGTTCACAAG CTAACGTACT GTACGAACTA GACGGCAAAC CTGCACTACA ATTATACAAG
CAACACCTTG GCGAACACGC CCAAAGCCTG CCTAGCGCTG GCCTCCTGTT TCCACTAACC
ATTCGTGAAG AAGGATCGAA TACAACACTT GTTAGAACAA TTTTAGGCGT AGATGAAAAA
GACCAAAGTG TGACGTTTGC CGGCAATGTC CCCACTGGGC AGTATGCCCG TTTTATGAAA
GCCAACTTCT CTCGCTTAGT TTCTGGCGCC GAGAATGCCG CTAACGAAAG TGTTAAACAC
TTTAATTCGG ATGAAGCAGA GCTAGCCGTT TTAGTAAGCT GTGTGGGTAG AAAAATGATT
CTTAAGCAAC GGACAGAAGA AGAAGTAGAA GCTGTAAGAA GCGTTTTAGG CCCGCAGGTT
GCTGTAACAG GTTTCTATTC ATACGGTGAA ATATCTCCCC ATACACCTAC GGCAAAATGC
GAGCTGCACA ATCAAACAAT GACTATTACA ACGTTTTGGG AAGAATAA
 
Protein sequence
MAIHLFYTYH LKAVFKDRRN MNVQQYSINT LPLEPTQETD LVFIFSGEHR DEFAANYERA 
RTAFPCATIV GCSTAGEIED TRVTSNHCVI TAVTFKNTST FSVCAEVTDI AHSKRAGAQL
AQKLLAHRNT PLKHIFVLSD GLNVNGSELV TGLVESVPSY ISVTGGLAGD NGDFTFTQTI
ANGAPKTGQV VAVGFYGDGI HVGHGCMGGW DPFGTERLIT RSQANVLYEL DGKPALQLYK
QHLGEHAQSL PSAGLLFPLT IREEGSNTTL VRTILGVDEK DQSVTFAGNV PTGQYARFMK
ANFSRLVSGA ENAANESVKH FNSDEAELAV LVSCVGRKMI LKQRTEEEVE AVRSVLGPQV
AVTGFYSYGE ISPHTPTAKC ELHNQTMTIT TFWEE