Gene Sde_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3238 
Symbol 
ID3965711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4128320 
End bp4130278 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content48% 
IMG OID637922335 
Producthypothetical protein 
Protein accessionYP_528707 
Protein GI90022880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000359216 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.364609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACC TATCAACCAA GCGTTTACTG CCATTAGCAG CGCTCGCAAG TGCAATTTCT 
TTCCATACGC ACGCTGCAGA CTTACACGTA TATAGCGGCG AAGATTTACA AGCAGCCATT
CAAGCTGCAC GAGCCGGCGA CGAAATTATC CTCCACCCAG GCGGTTACGA AAGTGTGGCC
ACATCCACCC TTAGCGGCTT TTCCGATGCG CATTACTACG GCGCCGCAGA CGGTACCGCA
AGCCAACCCA TTATTTTGCG CAGCCAGTCG GCCTCGAACA AACAAGTACT AACCGGTGCA
ACCCCAGATA AAAAAATTGG TTTATACATC ACCGGCGATT ACTGGATTAT CAAAGATATT
GTTGTAACAA ACTCTAAGCG CGGCATTGTG TTAGATAACT CTAACCACTC CATTATTGAT
AACGTGGTTG TACACAACAT AGGGCAAGAA GGTATTCACC TGCGCGATAA CAGCTCGTAC
AACATTGTTA AAAACTCTTG GGTTTACAAC ACAGGCGTTA CCAGCGCGGG CACGGGCGAA
GGCTTTTACG TTGGCTCGCA CCCTGGCAGC GATAACGGCA AAGGCTATGT GTACGGCGCG
GCATGTAATT ACAACGTAAT CGGCGGCAAC ACCATTGGGC CAAACGTACG TGCAGAGCAT
ATAGACATAA AAGAAGGCAC CAAAGGCACC ATATTCGAAT ACAACATTAT GGATGGAACG
GGTATTTCTG GTGCAAACTC GGCAGACAGC TTTGTAGACA TTAAAGGCCA AGACGCGGTT
ATTCGTGGCA ACAAAGCCTA CAGAAACGGC GAAGCAAAAG TTAAAAACGC ATTCGAAACT
CACAGCGATG CCGACAACAA CACGTTTATT GCAAACGAAG TGGACCTAGA CGGTAGCAGC
GATTACTTAC TGTTTGCTTC AGAAGGCGTA AACCACGTTT CTAGCGATAA CATACGTTTA
GATGGTCGCA GCGATCGTAT TGCCGGCCAT TGGTCCGCTG CAAGTGTCGA TTCGAATATC
CCTAGCGGTA TTCCTGCACC GGGCAACTAC AGCACTAGCT CCGGTTCGTC TTCCAGCTCA
TCGTCTAGTT CTTCTAGTTC GTCTTCAAGT TCATCCTCTA GCTCTTCAGG CTCCAGCTCT
GGCGGCGTAA ATTTACCCGC AAAAATTGAA GCAGAAAATT ACGATAGCGC ACCGGTAGAA
AGCACAAGCG GCAACGCAAA CAGCGGCAGC GTAACCAACT GCACCTACCG AGGCCTAAAT
GTAGATGTGC AAAACGCCAC AGAAAGCGGC TGCAATATTG GCTGGACAAC AGCCGGTGAA
GAAGTCACTT ACAACATTGG CAGCGCCAAC GCCAACTACA ATGTTGCTCT GCGAATCGCT
TCCAACTATT CCGGTAAGCG CTTAGCACTA TACGTTAACA ATACACACGC TGGCACAGTC
ACAACCAGTG GCAGCGGTTG GCAGGCATGG GAAACCAAAA CGCTTTCTAA CGTTTATATT
CCATCTAACA GTGTTTTAAC GGTTAAGTTT TTAGATGGCA GCACCAACTT TAATTACTTA
AACATAACAA CAGGCGGCAG CTCCTCTTCT AGCTCCAGCT CTAGCTCAAG TTCATCGTCA
AGCTCGTCTT CAAGCTCGTC ATCTAGCTCT AGCTCCTCCG GCGGCGGCGC ATGCTCAAGT
TATGTAGACA TTCCTTGGGA TGAGCGAACA GAGGTCACAC TGGGCAATGG TGTATGTGTA
CGCACAGCAC AAAATCTTGC TGGCAAAACC TTGCAACTAT GGGATAGCGA TACCAATAGC
TCGTGCGACT TCCGTGGCAC CGTTGTATCT ACAGATGGCA CCGGCAGTGT TAGCGTTAGT
AGTAACTACG TATCCACAAC AAATTTAACT GGCACCAAGC TTAATTTTGT ACCCGCCAGC
GGAACCAGTT GCCAATATGT AAAAGTACGT TCGTATTAA
 
Protein sequence
MRNLSTKRLL PLAALASAIS FHTHAADLHV YSGEDLQAAI QAARAGDEII LHPGGYESVA 
TSTLSGFSDA HYYGAADGTA SQPIILRSQS ASNKQVLTGA TPDKKIGLYI TGDYWIIKDI
VVTNSKRGIV LDNSNHSIID NVVVHNIGQE GIHLRDNSSY NIVKNSWVYN TGVTSAGTGE
GFYVGSHPGS DNGKGYVYGA ACNYNVIGGN TIGPNVRAEH IDIKEGTKGT IFEYNIMDGT
GISGANSADS FVDIKGQDAV IRGNKAYRNG EAKVKNAFET HSDADNNTFI ANEVDLDGSS
DYLLFASEGV NHVSSDNIRL DGRSDRIAGH WSAASVDSNI PSGIPAPGNY STSSGSSSSS
SSSSSSSSSS SSSSSSGSSS GGVNLPAKIE AENYDSAPVE STSGNANSGS VTNCTYRGLN
VDVQNATESG CNIGWTTAGE EVTYNIGSAN ANYNVALRIA SNYSGKRLAL YVNNTHAGTV
TTSGSGWQAW ETKTLSNVYI PSNSVLTVKF LDGSTNFNYL NITTGGSSSS SSSSSSSSSS
SSSSSSSSSS SSSGGGACSS YVDIPWDERT EVTLGNGVCV RTAQNLAGKT LQLWDSDTNS
SCDFRGTVVS TDGTGSVSVS SNYVSTTNLT GTKLNFVPAS GTSCQYVKVR SY