Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2929 |
Symbol | |
ID | 3968019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3719272 |
End bp | 3721293 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637922026 |
Product | hypothetical protein |
Protein accession | YP_528398 |
Protein GI | 90022571 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATTG GTACTGTTAC GGCTTCAGCA CTGGTTGGTC GAGGCCGTGG CACCCCTAAA AAAATAATCA ACAAGGGTTC TATTATGTGG CAAATCAACA AATCGGCTTT AGCGGCCGTG GTATTAGTGT GTTCCTCATC TAGCTTTGCG CAATCTGCAT GTGACACTCA ACGCATTGAA GCCGAAAATT ACGTGGCAAT GAGTGGTATT CAAACCGAAA GCACGGCAGA CACTGGTGGC GGTTTAAATG TGGGCTGGAT AGACGCCGGC GACTGGCTTA GTTACCAAGT TAACCTACCT GCTGCAGGGC AGTACGAGGT GCGCTATCGC GTTGCCAGTA GAAATGGCGG CGGTGTACTT CGGTTAGAGG GCAATGCCGG TCAAACCTTG TATGGAACTA TGAATGTACC CAACACGGGT GGCTGGCAAA ATTGGCAAAC CCTTTCTCAT TCAGTGACAT TAGCGGCAGG AGAGCAGTCT ATTGGTATTG GTGTGCCAAG CGGCGGGTTT AATATTAATT GGCTGGAGTT CGTACCTTTA GATTGCAGTG GGCCAATCGA CCCGCCCATT AACCCACCTT CGAACTGCGC GAGCATTGTA TTCGAGGCCG AAAATTACGA TCAAATGAGC GGCATTAGAA CGCAAACCAC AAGTGATACC GGAGGCGGCT TAAATGTGGG GTGGATAGAT GCTGGCGACT GGCTTAGCTA TGCCACTGTG AATATCCCCA GCACGCAGGT GTACAATTTT GAATACCGTG TGGCTAGCCC TAATGGCGGC AGTTTTAATT TGCAGGGTTC GGCTGGCGCA GAGAATTTTG ATACCGCTAC TTTGCCCAAT ACGGGTGGTT GGCAAAATTG GACAACGGTA ACAGGCTCGG CGCTTTTACC TGCTGGCAAT GTGAATTTCG GTATTAGTGC GATTACTGGT GGCTGGAATA TAAACTGGTT TAAAGCTACA CCAGAGAGCT GTGATGATAT AAACCCTCCA AGTACCGGTA TTACTGCTAA GCAAGCAGCG GCAGCCATGG GCAAGGGGTT TAATTTGGGG CAAATGTTCG AAAGTACGCA ACACCCAAGA ACATTTAATG CTGCAAAAAG TAAAATAGAT GCTTACTACA ATATGGGCTA CAGAAATGTG CGCATCCCTA TTACTTGGAC TGAAGCCGTA GGCGGAAACA GGCTTGTTGC AGATGCAAAT GTAGGCGCAG TCAATCGCAA CCACTCTCGC TTAGCTGTAA TTACTCAAGT AGTAGATTAC GCGCTTTCGC TACCCGGCAT GTACGTGGTT ATTAATGCGC ATCACGAAGG TGGATTAAAA ACCAATAATC GCTGGTGGGT GTTAGAAACT CTGTGGGCAG ATATTGCCGA TATATTTAAA GACAGAGATC ACCGTTTGCT ATTTGAAATA TTAAACGAGC CACACCTAAG CGATGCCAAT AAGTCGCCTA TGCCCCCCGC CAATTTGCGT TTTATGACGG GCAAAGCCTA TAACAAAATT CGCGCGATAG ATGCGCAGCG AATCGTTATT ATTGGTGGCA ACCAGTGGTT TGGTGCAGGT GAAATGGCAA ACGTATGGCC AAACCTTAAT GATGTTGGCG GCGGTTCCGA TGCATATGTA ATGGCTACTT TTCACCATTA CGACCCGTGG TCGTTTAGTG GCGATAACCA AGGCGATTAC GCCGATGCTT GGACGCTATC TAACGTGGGT AACCCAATGG ATATAATGCA AAGCTGGGCA AACGGCGTAG GCCAAGGTAT GCCTGTGTAT ATTGGCGAGT GGGGCGTAGG TTGGGGCAGC CGCTACAGCG CCATGCAGTG CAATAATATT CGCTATTGGT ACCAGCTGTT CGACGCGAGC TATGCCTCGG CAAAAGGCCA GCCTACGGCA GTGTGGGATG ACGGCGGTTG GTTTAAAATA TTCGACCACG GTACCAACAG CTTCAATAAT AATTTAGCCC AATGTATTGG TGGAAACTGC GCTTGGGATG GCGCCGATAG ATTTAATTCT GGCTGTAATT AA
|
Protein sequence | MLIGTVTASA LVGRGRGTPK KIINKGSIMW QINKSALAAV VLVCSSSSFA QSACDTQRIE AENYVAMSGI QTESTADTGG GLNVGWIDAG DWLSYQVNLP AAGQYEVRYR VASRNGGGVL RLEGNAGQTL YGTMNVPNTG GWQNWQTLSH SVTLAAGEQS IGIGVPSGGF NINWLEFVPL DCSGPIDPPI NPPSNCASIV FEAENYDQMS GIRTQTTSDT GGGLNVGWID AGDWLSYATV NIPSTQVYNF EYRVASPNGG SFNLQGSAGA ENFDTATLPN TGGWQNWTTV TGSALLPAGN VNFGISAITG GWNINWFKAT PESCDDINPP STGITAKQAA AAMGKGFNLG QMFESTQHPR TFNAAKSKID AYYNMGYRNV RIPITWTEAV GGNRLVADAN VGAVNRNHSR LAVITQVVDY ALSLPGMYVV INAHHEGGLK TNNRWWVLET LWADIADIFK DRDHRLLFEI LNEPHLSDAN KSPMPPANLR FMTGKAYNKI RAIDAQRIVI IGGNQWFGAG EMANVWPNLN DVGGGSDAYV MATFHHYDPW SFSGDNQGDY ADAWTLSNVG NPMDIMQSWA NGVGQGMPVY IGEWGVGWGS RYSAMQCNNI RYWYQLFDAS YASAKGQPTA VWDDGGWFKI FDHGTNSFNN NLAQCIGGNC AWDGADRFNS GCN
|
| |