Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_4098 |
Symbol | |
ID | 3936587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 4206920 |
End bp | 4208629 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637906484 |
Product | extracellular solute-binding protein |
Protein accession | YP_512040 |
Protein GI | 89056589 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00991271 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.791743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTAA CCCTTAGAAC AACCTCGGCC CTGGCGCTGA CCGTGGGGCT TCTGGCCACT CCGGCCATCG CCGATATGGA AGCGGCCATC GCGTTCCTGG ACGAGCATAT CGAGCATTCC GCGCTGACCC GCGAAGAGCA GGAAGCCGAG ATGCAATGGT TCGTCGACGC GGCCCAACCC TATCAGGGTA TGGAGATCCG CGTTGTGTCA GAGACCATCG CCACCCACGA ATATGAGGCC AATGTGCTGG CCCCCATCTT CGAGGCGATC ACCGGCATCA GCGTCACCCA CGACCTGATC GGCGAAGGCG ACGTGGTGGA GCGTCTGCAA ACGCAGATGC AGACCGGCGA AAACATCTAT GACGCCTACG TCAACGACAG TGATCTGATC GGCACCCACT GGCGCTATCA GCAGGCCCGC AACCTGACGG ACTGGATGGC GAATGAGGGC GCGGACGTTA CCAACCCCAA TCTGAACCTT GACGACTTCA TCGGCCTGTC GTTCACGACG GGTCCCGATG GGCTGCTGTA CCAGCTGCCC GACCAGCAGT TCGCGAACCT CTATTGGTTC CGCTACGACT GGTTCACCGA TCCAGAAATC ATGGCCGATT TCCAGGAGCA ATACGGCTAT GAGCTGGGTG TTCCGGTCAA CTGGTCTGCC TATGAGGATA TCGCGGAATT CTTCACCGGT CGTGAAATCG ACGGTGTCGA GGTCTTCGGT CACATGGACT ACGGTCGTCG CGATCCGTCG CTGGGTTGGC GTTTCACCGA TGCGTGGATG TCCATGGCTG GCATGGGCGA CATTGGCGAG CCGAACGGCC TGCCCGTCGA TGAGTGGGGC ATTCGTGTGA ACGAAGACAG CCGTCCCGTC GGCTCTTGCG TCGCGCGTGG CGGTGCTACC AACGGCCCGG CCGCAGTCTA CGCCATTGAG TCGTACACCA ACTGGCTGAC CAACTACGCA CCGCCGGAAG CGGCTGGTAT GAACTTCTCT GAGGCGGGGC CACTGCCGTC GCAGGGTGTG ATTGCGCAGC AGATGTTCTG GTACACGGCG TTTACCGCGT CGATGGTCGG CGAGGGTGCG GAAGCGGTGC TGAACGACGA CGGGTCGCCC CGTTGGCGGA TGGCCCCCAG CCCGCACGGT GTCTACTGGC GCGAAGGTCA GAAGATCGGC TACCAGGACG CGGGGTCCTG GACGTTGATG CAGTCCACAC CCGTGGACCG GGCGCAGGCC GCATGGCTTT ATGCTCAGTT CGTGACGTCG ATGACCGTGG ATGTCGAGAA GTCCCATGAG GGCCTCACGT TTATCCGCGA GTCCACGATC CAGCACGAGA GCTTCACCGA GCGTGCGCCA AACCTGGGGG GTCTGATCGA GTTCTATCGC TCGCCCGCCC GCACCCAGTG GTCGCCAACT GGTACGAACG TGCCTGATTA TCCACGTCTG GCGCCGCTGT GGTGGCAGAA CATCGGCGAT GCATCGTCCG GTGCACTGAC CCCGCAAGAG GCGCTCGATA ACCTTTGTGC GCAGCAAGAG GCCGTTCTGG CCCGTCTTGA GCGGGCAGGC GTTCAGGGGG ATCTCGGTCC GCTTCTCAAC GATGAAAGCG ATCCGGAGTT CTGGCTGTCT CAGGACGGTG CGCCCCAGGC CGCCCTTGAG AACGAGGACG AAGAGCCACA AACCGTCAGC TATGACGAGC TGATCCAATC CTGGCAGTAA
|
Protein sequence | MNLTLRTTSA LALTVGLLAT PAIADMEAAI AFLDEHIEHS ALTREEQEAE MQWFVDAAQP YQGMEIRVVS ETIATHEYEA NVLAPIFEAI TGISVTHDLI GEGDVVERLQ TQMQTGENIY DAYVNDSDLI GTHWRYQQAR NLTDWMANEG ADVTNPNLNL DDFIGLSFTT GPDGLLYQLP DQQFANLYWF RYDWFTDPEI MADFQEQYGY ELGVPVNWSA YEDIAEFFTG REIDGVEVFG HMDYGRRDPS LGWRFTDAWM SMAGMGDIGE PNGLPVDEWG IRVNEDSRPV GSCVARGGAT NGPAAVYAIE SYTNWLTNYA PPEAAGMNFS EAGPLPSQGV IAQQMFWYTA FTASMVGEGA EAVLNDDGSP RWRMAPSPHG VYWREGQKIG YQDAGSWTLM QSTPVDRAQA AWLYAQFVTS MTVDVEKSHE GLTFIRESTI QHESFTERAP NLGGLIEFYR SPARTQWSPT GTNVPDYPRL APLWWQNIGD ASSGALTPQE ALDNLCAQQE AVLARLERAG VQGDLGPLLN DESDPEFWLS QDGAPQAALE NEDEEPQTVS YDELIQSWQ
|
| |