Gene Sde_2308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2308 
Symbol 
ID3968138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2932444 
End bp2933979 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content45% 
IMG OID637921399 
Producthypothetical protein 
Protein accessionYP_527780 
Protein GI90021953 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00819182 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCGAA TCCCCAAGGC TTGGCTGGCA CTTCCACTTG TACTGGGAAG TACCAATCTA 
TACGCTCAAG TAACTTGCAG TATCTCTAAC ACCAATGTTT GGAATAACGG ATACACCGTT
AATGTTAATG TAACCAACAC AGGCTCTTCA CAGGTTGGTT CTTGGCAGGT TCCTATTAAT
TTTTCTGAGC CACCTCAAGT AAGCAGCGGC TGGAATGCAA TATTAAGCAC AAACGGAAAC
ACCGTAACTG CCGGCAATAT TGGTTGGAAT GGTAATTTAA ATCCCGGCCA AAGCGCCTCC
TTTGGTTTTC AAGGTGGCCA CGATGGCAGC TTTGTGGAGC CCACCTGCTC GGGCGGAGGC
TCTAGCACTA GCTCAAGCAG CTCTAGTAGT TCTAGCTCAA CAAGTTCTAC CAGTTCTTCA
TCCACAAGTT CAAGTAGCTC TTCTAGCTCC GGCGGCTCTG AACTTTTAAT CCAAGAAAAT
GCATCCGGCT TCTGCCGTGT GGACGGATCG ATAGATAACA ATAACTCAGG CTATACCGGT
AGTGGCTTTG CCAACACCGA GAACCAAAAC GGTTCCGCAG TTGAATACGC ACTTAACGTT
CCCTCTAATG GGAATTATCT CCTCGACGCT CGATATGCAA GCGCTACTAC ACGATCGGCT
AGCGTGGTAG TTAATGGATC TTCAGTAGGC AGCTTTAGTT TTCCATCTAC GGGTTCGTGG
ACAAGCTGGA CAGTTGACTC CGCCAACGTT CCGTTAAAAG GCGGGAATAA TATTGTTCGA
ATTGTTGCAA CTAACAGCAG CGGATTACCT AATATTGATT CATTAAAGGT AATAGGCACC
AACCCGTCAG CCGGCAGTTG TTCAAGCAAC TCGTCATCCA CTAGTTCATC GTCTAGCTCA
AGTTCATCAA GCAGTAACTC CGGTGGCAAA GGCTCTAGCT GCCGTTCTAC AGGCAGTCAA
TCTGTTTCCT CTACTATTAA AGTTACTAGC GGGACTTTCG ATGGGAACTG TAAAACGTAT
AACCCTACAA GTGCCCTTGG CGATGGCAGT CAATCAGAAA GCCAGAAACC GGCATTCCGA
GTGGAGAACG GCGCAACACT CAAAAACGTG ATTCTAGGCA ACAATGGCGT AGACGGTATT
CATGTTTATA ACGGCGGCAC CTTGGATAAC ATCCGCTGGA CCAATGTGGG TGAAGATGCA
ATGACCGTTA AATCTGAAGG AAACGTTACC GTTTCAAATA TTGAGGGTTA TGACGGTTCA
GATAAATTTA TACAAGTAAA CGCAGTTACC AACCTAAAGG TTTCTAATTG CATTGTAGAT
AAAATGGGTA AATTTTTACG TCAGAATGGC GGTAAAACTT TCGCTATGTC TGTAACCGTA
GATAATTGTG ATATCTCAAA TATGGGTGAA GGTGTTTTCC GCTCAGACAG CCCAAATGCA
ACAGCGAGAA TCACAAATAG CCGATTAAAA AATGCAGGCG ACATTTGTAT TGGTAAGTGG
AAAAGCTGCA CATCTTCCAA CATTACCAGC TTCTAA
 
Protein sequence
MLRIPKAWLA LPLVLGSTNL YAQVTCSISN TNVWNNGYTV NVNVTNTGSS QVGSWQVPIN 
FSEPPQVSSG WNAILSTNGN TVTAGNIGWN GNLNPGQSAS FGFQGGHDGS FVEPTCSGGG
SSTSSSSSSS SSSTSSTSSS STSSSSSSSS GGSELLIQEN ASGFCRVDGS IDNNNSGYTG
SGFANTENQN GSAVEYALNV PSNGNYLLDA RYASATTRSA SVVVNGSSVG SFSFPSTGSW
TSWTVDSANV PLKGGNNIVR IVATNSSGLP NIDSLKVIGT NPSAGSCSSN SSSTSSSSSS
SSSSSNSGGK GSSCRSTGSQ SVSSTIKVTS GTFDGNCKTY NPTSALGDGS QSESQKPAFR
VENGATLKNV ILGNNGVDGI HVYNGGTLDN IRWTNVGEDA MTVKSEGNVT VSNIEGYDGS
DKFIQVNAVT NLKVSNCIVD KMGKFLRQNG GKTFAMSVTV DNCDISNMGE GVFRSDSPNA
TARITNSRLK NAGDICIGKW KSCTSSNITS F