Gene Sde_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0906 
Symbol 
ID3965542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1168671 
End bp1171037 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content49% 
IMG OID637919968 
ProductNdvB protein 
Protein accessionYP_526380 
Protein GI90020553 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000016442 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTAAAAG CCATTAACAA CGGCGAACGC TATCAACTCA CTAGCCCTAC CGCTATGCCG 
CAAAGCGCAT CGTTTTTATG GAATAAAAAA ATGATGATAC AAGTAAATTG CCGCGGCTAC
GCCGTTGCGC AATTTATGCA GCCAGAACCA GCCAAATACG CTTACGCACC CAATCTGGAA
GCAAAAACAT TTATGCAACC AGAGCAACCC TATTACGCGC ATCACCCCGG GCGCTTTTTC
TATATAAAAG ATGAAGAGAC AGGCGAGATT TTTTCGGCAC CCTACGAGCC TGTGCGCAGC
CAGCTGAACA ACTTTAGCTT TAACGCAGGC AAGAGCGATA TAAGCTGGCA TATTGCCGCT
TTAGGCATTG AAGTAGAGCT ATGTCTTAGC CTGCCGGTGG ACGATGTAGT AGAATTGTGG
GAACTAAAAA TAAAAAACGG CGGCGCGCAA CCTCGTAAAC TCAGTATTTA CCCGTACTTT
CCTGTGGGTT ACATGTCGTG GATGAATCAA TCTGGTGACT ACAGCCAAAC CGCCGGCGGC
ATTATTGCCA GCTGCGTAAC GCCTTATCAA AAAGTCGCCG ACTACTTTAA GAATAAAGAC
TTTAAAGATA AAACGTTCTT TCTTCACGAA ACCGCCCCAG CAGCATGGGA AGTAAACCAG
AAAAACTTCG AAGGCGAAGG CGGGTTGCAC AACCCCAACG CCATACAACA AGAAACGCTG
GGCTGCGGCA ACGCATTGTA CGAAACGCCC ACAGCGGTAT TGCAATACCG CCGCGAACTT
GCAGCGCAAG AGCAGCAAAC CTTTCGCTTT ATTTTTGGCC CAGCATTTGA CGAGAGCGAA
GCCATTGCAC TGCGCAATAA GTATTTATCT GCCGAAGGTT TTGCCAAAGC AAAAAGCGAA
TACCAAACCT ATATAACGAG CGGCAAAGGC TGCTTGCAAA TTAACACCCC AGACCCAGAA
CTAAACAACT TTGTAAACCA CTGGCTACCG CGCCAAGTGT TTTATCACGG CGATGTAAAC
CGGTTAACCA CCGACCCGCA AACGCGCAAT TATATTCAAG ACAATATGGG CATGAGCTAC
ATTAAGCCCA ACATTACGCG GCAGGCGTTT TTACATGCCT TAAGCCAGCA GGAAGAAAGC
GGTGCAATGC CCGACGGCAT TTTATTGCTT GAAGGCGCCG AGCTTAAATA CATAAACCAA
ATACCCCATA CCGATCACTG CGTTTGGCTG CCGGTGTGTA TGCAAGCCTA TTTGGATGAA
ACCAATGACT ACGCCCTATT AGACGAAATA GTACCCTATG CGAGTGGCGA GAAGCGCGAA
ACTGTTGAGC AACATATGCA TCACGCTATG CGCTGGCTTT TGCAAGCACG CGACGAACGC
GGCCTAAGCT TTATCGCACA GGGCGACTGG TGCGACCCCA TGAACATGGT GGGCTACAAG
GGCAAAGGGG TATCCGGCTG GCTTTCAGTC GCTACCGCTT ATGCATTAAA CCTGTGGGCA
GATGTTTGCG AACAACGGCA GCAAAACAGT TGCGCCAACG AATTTAGACA GGGCGCTAAA
GATATAAACG CGGCGGTAAA CAAGCATATT TGGGATGGCG AATGGTTTGG CCGCGGCATT
ACAGATGACG GCGTACTGTT TGGCACCAGC AAAGATAAAG AAGGCAGAAT TTTTCTAAAC
CCACAAAGCT GGGCAATACT TGGCGGCGCC GCCGACGAAC AAAAAATCCC ATGCCTGCTA
GACGCAGTAG AGCAACAACT GGAAACCCCT TACGGCGTAA TGATGCTGGC CCCCGCGTTT
ACCGCCATGC GCGATGACGT AGGCCGAGTT ACCCAAAAAT TCCCAGGCTC TGCAGAAAAC
GGCTCTGTTT ATAATCACGC GGCGGTGTTT TATATATTTA GCTTGTTATC CATTGGCGAG
AGCGAACGCG CATATAAACT GCTACGCCAA ATGCTGCCTG GGCCAGATGA AGCCGATCTT
TTACAGCGCG GCCAACTGCC AGTATTCATA CCTAACTATT ATCGCGGCGC ATACTACCAG
CACCCCCGCA CCGCCGGTCG CTCTAGCCAG CTCTTTAATA CGGGTACAGT CTCGTGGGTT
TACCGCTGCT TAATTGAAGG GGTATTCGGC TTGAAAGGCT CGCCACAAGG CTTAGTTGTA
CAACCGCAAC TGCCTGTCGC CTGGCAAACA GCAGAAGCCG TTAGGGAATT TAGAGGCGCA
ACGTTTAACG TGAGCTACCG CAAAAGCAGC GATATAAAAG AAATGGAAAT ACAGCTAAAT
GAATCGGTAA TAAGTGGCAA CACCATCTCC GACATCACCG CCGGCGCGAC CTATCAATTA
ACCGTTCTAT TACCTGCCAC ACACTAA
 
Protein sequence
MLKAINNGER YQLTSPTAMP QSASFLWNKK MMIQVNCRGY AVAQFMQPEP AKYAYAPNLE 
AKTFMQPEQP YYAHHPGRFF YIKDEETGEI FSAPYEPVRS QLNNFSFNAG KSDISWHIAA
LGIEVELCLS LPVDDVVELW ELKIKNGGAQ PRKLSIYPYF PVGYMSWMNQ SGDYSQTAGG
IIASCVTPYQ KVADYFKNKD FKDKTFFLHE TAPAAWEVNQ KNFEGEGGLH NPNAIQQETL
GCGNALYETP TAVLQYRREL AAQEQQTFRF IFGPAFDESE AIALRNKYLS AEGFAKAKSE
YQTYITSGKG CLQINTPDPE LNNFVNHWLP RQVFYHGDVN RLTTDPQTRN YIQDNMGMSY
IKPNITRQAF LHALSQQEES GAMPDGILLL EGAELKYINQ IPHTDHCVWL PVCMQAYLDE
TNDYALLDEI VPYASGEKRE TVEQHMHHAM RWLLQARDER GLSFIAQGDW CDPMNMVGYK
GKGVSGWLSV ATAYALNLWA DVCEQRQQNS CANEFRQGAK DINAAVNKHI WDGEWFGRGI
TDDGVLFGTS KDKEGRIFLN PQSWAILGGA ADEQKIPCLL DAVEQQLETP YGVMMLAPAF
TAMRDDVGRV TQKFPGSAEN GSVYNHAAVF YIFSLLSIGE SERAYKLLRQ MLPGPDEADL
LQRGQLPVFI PNYYRGAYYQ HPRTAGRSSQ LFNTGTVSWV YRCLIEGVFG LKGSPQGLVV
QPQLPVAWQT AEAVREFRGA TFNVSYRKSS DIKEMEIQLN ESVISGNTIS DITAGATYQL
TVLLPATH