Gene Sde_0683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0683 
Symbol 
ID3964934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp869047 
End bp870240 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content45% 
IMG OID637919744 
Productsigma-70 factor 
Protein accessionYP_526157 
Protein GI90020330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00481968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000266294 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTATTAG CTCAATTGCC AGTTAAAAAA TATTTTGTCT TATTAGCTAT TTTCTCGTTT 
ATGCTGGGGT GTAATAGTGC TGGCGTACAA CAAAGTGCTA AATCAATTCA GGTTGCTGGC
ACGCACAGTA AACCCGCTCG TTTTTTTGCT GGTGCCGACC TTTCTTACGT AAACGAAATG
GAAGATTGCG GAGCAACATA CCGCGTAAAC GGTGTAACTA CCGACCCTTA CCAAGCCTTT
GCCGATGCCG GCGCAAATTT AGTGCGCGTG CGCTTATGGC ACAACCCTAC TTGGACAGAA
TATTCCGACT TTGCCGACGT TAAAAAAACT ATCCGCAAAG CCAAACAAAA TAATCAAACG
GTATTGTTAG ATTTTCATTA TTCAGATACC TGGGCCGACC CAGAAAAACA ATTTGTTCCA
GCCGCTTGGG AACATATGGT GGATGACACC CCAGCACTAG CGCAAGCCTT AGCGCAATAC
ACAACCGATG TATTAGAAAA GCTGCAAGCA GAAAACCTAT TGCCAGATAT GGTGCAAGTA
GGTAACGAAA CAAACGCAGA AGTCTTACAG CTAGAAGCGC ACATGAAACA CGGCGAAATA
GATTGGCAGC GCAATGCAGC GCTACTAAAC AGTGGGTTAG CAGCCGTTGC TGAATTTAAC
CAAAACAACA ACACCTATAT TGAACGCGTA TTACATATCG CCCAGCCAGA AAATGCTTTG
TGGTGGTTTG ACGATGCCGC GCAGGCTGGC ATAACCGATT TTGAAATTAT AGGTCTTAGC
TACTATGCCA AATGGTCAAC GTATAAATTA GATTCCATCG GCGAAGCTAT ACGCGCCTTG
CGAACCGCAT TCAATAAAGA TGTGTTGGTG GTAGAAACCT CATACCCCTG GACTATGCAA
AATTTCGATC AAGCCAATAA CGTGCTCGAT GCTACCAGCT TGCAGCAGGG CTACCCTGCA
ACGGCCGAAG GCCAAAAAAA ATACATGATG GATTTAGCTA AACAAATTAT GTACGCCGGT
GGAATTGGTA TTGCCTACTG GGAACCAGCT TGGGTAAGCA CCCCTTGCAA AACTCTATGG
GGTACAGGTT CTCACTGGGA AAATGCCGTG TTTTTTGACT CTGGCAACAA CAACGAAGCG
CTACCCGCGC TTAGTTTCTA CACAGACATA ATGGCTCTTT TTAAGCAAGA TTAA
 
Protein sequence
MLLAQLPVKK YFVLLAIFSF MLGCNSAGVQ QSAKSIQVAG THSKPARFFA GADLSYVNEM 
EDCGATYRVN GVTTDPYQAF ADAGANLVRV RLWHNPTWTE YSDFADVKKT IRKAKQNNQT
VLLDFHYSDT WADPEKQFVP AAWEHMVDDT PALAQALAQY TTDVLEKLQA ENLLPDMVQV
GNETNAEVLQ LEAHMKHGEI DWQRNAALLN SGLAAVAEFN QNNNTYIERV LHIAQPENAL
WWFDDAAQAG ITDFEIIGLS YYAKWSTYKL DSIGEAIRAL RTAFNKDVLV VETSYPWTMQ
NFDQANNVLD ATSLQQGYPA TAEGQKKYMM DLAKQIMYAG GIGIAYWEPA WVSTPCKTLW
GTGSHWENAV FFDSGNNNEA LPALSFYTDI MALFKQD