Gene Sde_2937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2937 
Symbol 
ID3968027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3730327 
End bp3731508 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content47% 
IMG OID637922034 
Producthypothetical protein 
Protein accessionYP_528406 
Protein GI90022579 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.807989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCG AAGACGCTAG TCGTTTTGAA GAGTTAGTGC GTTCGCCTAT TCGCAGCGAA 
GAGCGCGAGA GTGCAATAAA GCAGTGGCAT ACATACAAAG CTGGCAAACA AATGGCGTTT
GTTCAGCGTG GCAATGCGGG TGAAGGTGGC GCCAATAATG GTAAGGTTGG TTTGTGGAGT
AAGGGCGATT ACGTTCGCTC ATTAACTGTG CCAGCTATAG ATTTATGCGA TGCAACGTTT
GAGGATGTGT GCTTAGGATA TGTAGATATG CGGGGCGCGA AGATGGACCG CGTTCGGTTT
ACCCAAAACT ACCTTACTTG GTCTGCAATG AAGGGGGCGC AATTAGAATG CGCTAGCCTA
TGTGATGCAG ATTTACCCCA TATACGGTTG TTAGACGCAA ATTTAATGGG CGTAAACCTG
AGTGGTGCTA ATTTGCAGGG GGCGGATTTT TCTCGAGCGA ACTTGGCCGG GGTTAACCTA
AAGGGCGCTG ATTTGCGTGG TGCTATATTA GAGTATGCAA ATATGGTGGG CGCCAATGTA
GAAGGCGCAA AGCTGGATGG CGCACATGTA TATGGTGTTT CTGCGTGGGA TTTACGCGGC
CAGCCGGCTT CGTCAACCGA CTTAATAGTA ACCCCTCAGC ACACACCTAG TATTACTACC
GATAATATTC GCGTAGCGCA GTTTATTTAC CTGCTAGTAA ACAACCCAGA AATTCGCGAT
GTGCTAGATA CGGTTACACG TAAGGTTGTT TTGCTACTGG GGCGATTTAA GCCCGAGCGC
AAAGCGGTGC TAGATGCACT AAAAGTAGAG CTTCGCAGCC GCAATCTTGT GCCAGTCATT
TTTGATTTCG ACCAAGCTGA AGCCCGAGAC ATTACAGAAA CTATTACCCT GCTTGCTAGA
ATATCTAGGT TTGTAATTGC AGATCTAACA GAGCCCTCAA GTATCCCTCA AGAGCTGCAA
GCTATTGCGC CCGATGTAGC AGTGCCCATT CGGCTAATTA TTCAAGAGCC GCACGCGCCT
TATGCCATGG CAAAAGATTT AAAAAAATAC CCCTGGGTGA TAAAACCCTT CAAATACACT
GATATTCACC ATCTATTGGC ACAGCTAGAT GTCGACATAA TACATGTTGC AAATGCTGTT
GCCGAAAGTA TTAATCAAAT GCGTGCAGAG GATGATTGGT AG
 
Protein sequence
MNTEDASRFE ELVRSPIRSE ERESAIKQWH TYKAGKQMAF VQRGNAGEGG ANNGKVGLWS 
KGDYVRSLTV PAIDLCDATF EDVCLGYVDM RGAKMDRVRF TQNYLTWSAM KGAQLECASL
CDADLPHIRL LDANLMGVNL SGANLQGADF SRANLAGVNL KGADLRGAIL EYANMVGANV
EGAKLDGAHV YGVSAWDLRG QPASSTDLIV TPQHTPSITT DNIRVAQFIY LLVNNPEIRD
VLDTVTRKVV LLLGRFKPER KAVLDALKVE LRSRNLVPVI FDFDQAEARD ITETITLLAR
ISRFVIADLT EPSSIPQELQ AIAPDVAVPI RLIIQEPHAP YAMAKDLKKY PWVIKPFKYT
DIHHLLAQLD VDIIHVANAV AESINQMRAE DDW