Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0683 |
Symbol | |
ID | 3964934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 869047 |
End bp | 870240 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637919744 |
Product | sigma-70 factor |
Protein accession | YP_526157 |
Protein GI | 90020330 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00481968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000266294 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTATTAG CTCAATTGCC AGTTAAAAAA TATTTTGTCT TATTAGCTAT TTTCTCGTTT ATGCTGGGGT GTAATAGTGC TGGCGTACAA CAAAGTGCTA AATCAATTCA GGTTGCTGGC ACGCACAGTA AACCCGCTCG TTTTTTTGCT GGTGCCGACC TTTCTTACGT AAACGAAATG GAAGATTGCG GAGCAACATA CCGCGTAAAC GGTGTAACTA CCGACCCTTA CCAAGCCTTT GCCGATGCCG GCGCAAATTT AGTGCGCGTG CGCTTATGGC ACAACCCTAC TTGGACAGAA TATTCCGACT TTGCCGACGT TAAAAAAACT ATCCGCAAAG CCAAACAAAA TAATCAAACG GTATTGTTAG ATTTTCATTA TTCAGATACC TGGGCCGACC CAGAAAAACA ATTTGTTCCA GCCGCTTGGG AACATATGGT GGATGACACC CCAGCACTAG CGCAAGCCTT AGCGCAATAC ACAACCGATG TATTAGAAAA GCTGCAAGCA GAAAACCTAT TGCCAGATAT GGTGCAAGTA GGTAACGAAA CAAACGCAGA AGTCTTACAG CTAGAAGCGC ACATGAAACA CGGCGAAATA GATTGGCAGC GCAATGCAGC GCTACTAAAC AGTGGGTTAG CAGCCGTTGC TGAATTTAAC CAAAACAACA ACACCTATAT TGAACGCGTA TTACATATCG CCCAGCCAGA AAATGCTTTG TGGTGGTTTG ACGATGCCGC GCAGGCTGGC ATAACCGATT TTGAAATTAT AGGTCTTAGC TACTATGCCA AATGGTCAAC GTATAAATTA GATTCCATCG GCGAAGCTAT ACGCGCCTTG CGAACCGCAT TCAATAAAGA TGTGTTGGTG GTAGAAACCT CATACCCCTG GACTATGCAA AATTTCGATC AAGCCAATAA CGTGCTCGAT GCTACCAGCT TGCAGCAGGG CTACCCTGCA ACGGCCGAAG GCCAAAAAAA ATACATGATG GATTTAGCTA AACAAATTAT GTACGCCGGT GGAATTGGTA TTGCCTACTG GGAACCAGCT TGGGTAAGCA CCCCTTGCAA AACTCTATGG GGTACAGGTT CTCACTGGGA AAATGCCGTG TTTTTTGACT CTGGCAACAA CAACGAAGCG CTACCCGCGC TTAGTTTCTA CACAGACATA ATGGCTCTTT TTAAGCAAGA TTAA
|
Protein sequence | MLLAQLPVKK YFVLLAIFSF MLGCNSAGVQ QSAKSIQVAG THSKPARFFA GADLSYVNEM EDCGATYRVN GVTTDPYQAF ADAGANLVRV RLWHNPTWTE YSDFADVKKT IRKAKQNNQT VLLDFHYSDT WADPEKQFVP AAWEHMVDDT PALAQALAQY TTDVLEKLQA ENLLPDMVQV GNETNAEVLQ LEAHMKHGEI DWQRNAALLN SGLAAVAEFN QNNNTYIERV LHIAQPENAL WWFDDAAQAG ITDFEIIGLS YYAKWSTYKL DSIGEAIRAL RTAFNKDVLV VETSYPWTMQ NFDQANNVLD ATSLQQGYPA TAEGQKKYMM DLAKQIMYAG GIGIAYWEPA WVSTPCKTLW GTGSHWENAV FFDSGNNNEA LPALSFYTDI MALFKQD
|
| |