Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_4239 |
Symbol | |
ID | 3932391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007801 |
Strand | + |
Start bp | 26860 |
End bp | 28413 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637902296 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_512179 |
Protein GI | 89057725 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0553335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTCG ACCTGCGTTT CTATCTGCGC CTGTTCATCC GACGACTGCC GGCCATGATG ACGATCATTA TCGTCTGCTC AGCCATCGGC GTCGTGCTGG CGATGCGCCT GCCGCCGACT TATTCGACGT CTGCACGACT TTTGATGGAA AGTCCGCAAA TTCCTGACGC ACTGGCCGCT GCGATCGTCA ATGTATCCCC TGATGAAGAG ATCGAAATCA TCCGTCAGAG GCTGCTGACC CGGGTCAATC TGCTCAACAT TGCCAATGAA ATGAATGTGT TTGAGAACTA CGCCGCCACG CCAGCGGACG AGATGGTGCA ACACATGCGT GCGCATACCA GAATCCGCAG CACCGGAGGC CAGAATCAAC CCACTCTGAT TACGGTTAGC TTCGAGGCAC GCGCGCCCCA GATCGCAGCC AATGTAGTTA ATGAATATGT CACCCGCATC GTCAGCGCGA ATGTAGAATT GCGCACTGGA TTGGCTGAAG ACACATTGTC ATTCTTTGAG CAGGAAGTCA GCCGCCTCTC CACAGAATTG GATCTGCGGA GTGGGTACAT CGCAGAGTTC CAGATAGAAA ATGCCGACGC CCTACCGGAC GATCAAGCGT TCCGGCTAAA CCGTCAGGCG CTCTTGCAAG AGCGTATCGG GAGTGCCGAA CGCGAACGTG CAACCCTTCA GGATCAGCGC GCCCGTGTGA TTGCGATTTT TGAGCAAACC GGTCAAATCA ACTCAGGCAA CCCCGAAACG CTCAGCCCAG AAGAACAGCA ACTGCATGCG GCTGAAGCGG AGTTGGCCAA CGCGCTTACA ATTTTTTCCG AAACCAACCC TCAAGTGCAA TTGCTTCAGC GCCAGGTTGA GCGATTGCGC GAACAGATAG CAGCCGGAAT AGAAACAGGT CAGGAAGAGG GAGAAAACGA CGACGCGCGC TCCAGCACAC GCACCGTTCT GGATCTACAA CTGACCCAAA TTGACACTCA AATCACCGCA CTCGACACTC TGATCGCCGA GACACAGCAA GAACTACAGC GGTTGGAGGA TGCCATTTCA CGCACTCCGA GCAATGCGAT CACACTCAGT AGCCTAGAGC GCAACTACGA AAATATCCGC GATCAGTATG ATCAGGCCGT TGCGCGGTTG GCCGATGCCA GCACAGGTGA GCGGATTGAA CTGACCTCCC GGGGGCAGCG GATATCCTTG ATTGAATCTG CCAACGTACC CAGAAGCCCA TCTAGTCCAG ATCGCCCAAT GGTCGCCGCC ACTGGTGTCG CGTTAGGGAT CGGTTTGGCT GGCGCTCTGT TCTTGTTGCT TGAACTCCTG AACCGCACCG TGCGACGCCC GGTGGAGATA ACAAATGCGC TTGGCATCCA ACCTCTGGCC GTAATCCCTT ACCTAGACAC ACAAAGCCTC AGACTAAGAC GTCGATTGAT ACGCTTGATA TCGCTGATAA TTGTGATTTT GGGCGTACCG GCCGCCCTCT GGGCGATCGA TATGTATTAT ATGCCTCTTG ACCAATTGGC TGAGCGGGTC CTGAGTCGTC TAGGCGTGGG TTGA
|
Protein sequence | MNFDLRFYLR LFIRRLPAMM TIIIVCSAIG VVLAMRLPPT YSTSARLLME SPQIPDALAA AIVNVSPDEE IEIIRQRLLT RVNLLNIANE MNVFENYAAT PADEMVQHMR AHTRIRSTGG QNQPTLITVS FEARAPQIAA NVVNEYVTRI VSANVELRTG LAEDTLSFFE QEVSRLSTEL DLRSGYIAEF QIENADALPD DQAFRLNRQA LLQERIGSAE RERATLQDQR ARVIAIFEQT GQINSGNPET LSPEEQQLHA AEAELANALT IFSETNPQVQ LLQRQVERLR EQIAAGIETG QEEGENDDAR SSTRTVLDLQ LTQIDTQITA LDTLIAETQQ ELQRLEDAIS RTPSNAITLS SLERNYENIR DQYDQAVARL ADASTGERIE LTSRGQRISL IESANVPRSP SSPDRPMVAA TGVALGIGLA GALFLLLELL NRTVRRPVEI TNALGIQPLA VIPYLDTQSL RLRRRLIRLI SLIIVILGVP AALWAIDMYY MPLDQLAERV LSRLGVG
|
| |