Gene Sde_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3603 
Symbol 
ID3966465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4563392 
End bp4564777 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content50% 
IMG OID637922700 
ProductTonB-like 
Protein accessionYP_529070 
Protein GI90023243 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.70293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00573623 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACCT TTAACCCAGA TTTCGTATGG GGAGCAGCCA GTTCCGCCTA TCAGGTAGAA 
GGCGCCACCA CCACCGATGG CAGAGGCCCC AGTATTTGGG ATGCGTTCAG TTCCATTCCC
GGTAAAACCT ACCACAACCA AAACGCCGAC ATAGCCTGCG ACCACTACAA CCGCTGGCAA
GAAGACGTGG CCATAATGAA AGAGATGGGG CTAAAGGCTT ACCGCTTTTC TATTTCTTGG
TCGCGCATAT TCCCTACTGG GCGCGGCGAA GTTAACGAAA AAGGCGTAGC CTTTTACAAC
AACCTTATCG ACGAATTAAT AAAAAACGAC ATTACCCCTT GGGTAACCCT ATTTCACTGG
GACTTTCCTC TGGCACTGCA AATGGAAATG GACGGCCTAC TTAACCCCGC CATCGCCGAC
GAATTCGCCA ACTACGCCAA GCTGTGTTTC GCGCGCTTTG GCGACCGCGT TACCCACTGG
ATTACCCTAA ACGAACCTTG GTGCAGTGCC ATGCTTGGCC ACGGCATGGG CAGCAAAGCC
CCTGGCCGCG TATCTAAGGA TGAACCCTAT ATAGCCGCCC ACAACTTGCT GCGTGCACAC
GGCAAAATGG TAGATATTTA CCGGCGCGAA TTTCAGCCCA CACAAAAAGG CATGATAGGC
ATAGCCAACA ATTGCGACTG GCGCGAACCC AAAACCGATT CTGAATTAGA TAAAAAAGCA
GCCGAGCGCG CCCTAGAATT TTTTGTAAGC TGGTTTGCCG ACCCCATTTA TTTGGGCGAC
TACCCAGCCA GCATGCGCGA GCGCTTGGGT GAGCGTTTAC CCACCTTTAG CGACGAAGAC
ATTGCGCTAA TAAAAAACTC TAGCGACTTT TTTGGTTTGA ATCACTACAC CACCATGCTT
GCCGAACAAA CCCACGAAGG TGACGTTGTT GAAGATACTA TTCGCGGCAA CGGCGGCATA
TCGGAAGACC AAATGGTCAC CCTCTCCAAA GACCCAAGCT GGGAACAAAC CGACATGGAG
TGGAGCATTG TGCCCTGGGG CTGTAAAAAA TTATTAATCT GGTTAAGCGA GCGCTACAAC
TACCCCGACA TTTACATTAC CGAAAACGGC TGCGCCCTAC CCGACGAAGA CGACGTAAAC
ATAGCCATTA ACGATACACG CCGCGTAGAT TTTTACCGCG GTTATATCGA TGCGTGTCAC
CAAGCAATAG AGGCCGGCGT AAAACTAAAA GGCTATTTTG CATGGACACT TATGGATAAC
TACGAATGGG AAGAAGGCTA CACCAAACGC TTTGGCTTAA ACCATGTAGA TTTCACCACA
GGCAAACGCA CACCTAAACA GTCTGCAATT TGGTATAGCA CGTTAATTAA AGATGGTGGG
TTCTAG
 
Protein sequence
MKTFNPDFVW GAASSAYQVE GATTTDGRGP SIWDAFSSIP GKTYHNQNAD IACDHYNRWQ 
EDVAIMKEMG LKAYRFSISW SRIFPTGRGE VNEKGVAFYN NLIDELIKND ITPWVTLFHW
DFPLALQMEM DGLLNPAIAD EFANYAKLCF ARFGDRVTHW ITLNEPWCSA MLGHGMGSKA
PGRVSKDEPY IAAHNLLRAH GKMVDIYRRE FQPTQKGMIG IANNCDWREP KTDSELDKKA
AERALEFFVS WFADPIYLGD YPASMRERLG ERLPTFSDED IALIKNSSDF FGLNHYTTML
AEQTHEGDVV EDTIRGNGGI SEDQMVTLSK DPSWEQTDME WSIVPWGCKK LLIWLSERYN
YPDIYITENG CALPDEDDVN IAINDTRRVD FYRGYIDACH QAIEAGVKLK GYFAWTLMDN
YEWEEGYTKR FGLNHVDFTT GKRTPKQSAI WYSTLIKDGG F