Gene Jann_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4239 
Symbol 
ID3932391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007801 
Strand
Start bp26860 
End bp28413 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content55% 
IMG OID637902296 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_512179 
Protein GI89057725 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0553335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCG ACCTGCGTTT CTATCTGCGC CTGTTCATCC GACGACTGCC GGCCATGATG 
ACGATCATTA TCGTCTGCTC AGCCATCGGC GTCGTGCTGG CGATGCGCCT GCCGCCGACT
TATTCGACGT CTGCACGACT TTTGATGGAA AGTCCGCAAA TTCCTGACGC ACTGGCCGCT
GCGATCGTCA ATGTATCCCC TGATGAAGAG ATCGAAATCA TCCGTCAGAG GCTGCTGACC
CGGGTCAATC TGCTCAACAT TGCCAATGAA ATGAATGTGT TTGAGAACTA CGCCGCCACG
CCAGCGGACG AGATGGTGCA ACACATGCGT GCGCATACCA GAATCCGCAG CACCGGAGGC
CAGAATCAAC CCACTCTGAT TACGGTTAGC TTCGAGGCAC GCGCGCCCCA GATCGCAGCC
AATGTAGTTA ATGAATATGT CACCCGCATC GTCAGCGCGA ATGTAGAATT GCGCACTGGA
TTGGCTGAAG ACACATTGTC ATTCTTTGAG CAGGAAGTCA GCCGCCTCTC CACAGAATTG
GATCTGCGGA GTGGGTACAT CGCAGAGTTC CAGATAGAAA ATGCCGACGC CCTACCGGAC
GATCAAGCGT TCCGGCTAAA CCGTCAGGCG CTCTTGCAAG AGCGTATCGG GAGTGCCGAA
CGCGAACGTG CAACCCTTCA GGATCAGCGC GCCCGTGTGA TTGCGATTTT TGAGCAAACC
GGTCAAATCA ACTCAGGCAA CCCCGAAACG CTCAGCCCAG AAGAACAGCA ACTGCATGCG
GCTGAAGCGG AGTTGGCCAA CGCGCTTACA ATTTTTTCCG AAACCAACCC TCAAGTGCAA
TTGCTTCAGC GCCAGGTTGA GCGATTGCGC GAACAGATAG CAGCCGGAAT AGAAACAGGT
CAGGAAGAGG GAGAAAACGA CGACGCGCGC TCCAGCACAC GCACCGTTCT GGATCTACAA
CTGACCCAAA TTGACACTCA AATCACCGCA CTCGACACTC TGATCGCCGA GACACAGCAA
GAACTACAGC GGTTGGAGGA TGCCATTTCA CGCACTCCGA GCAATGCGAT CACACTCAGT
AGCCTAGAGC GCAACTACGA AAATATCCGC GATCAGTATG ATCAGGCCGT TGCGCGGTTG
GCCGATGCCA GCACAGGTGA GCGGATTGAA CTGACCTCCC GGGGGCAGCG GATATCCTTG
ATTGAATCTG CCAACGTACC CAGAAGCCCA TCTAGTCCAG ATCGCCCAAT GGTCGCCGCC
ACTGGTGTCG CGTTAGGGAT CGGTTTGGCT GGCGCTCTGT TCTTGTTGCT TGAACTCCTG
AACCGCACCG TGCGACGCCC GGTGGAGATA ACAAATGCGC TTGGCATCCA ACCTCTGGCC
GTAATCCCTT ACCTAGACAC ACAAAGCCTC AGACTAAGAC GTCGATTGAT ACGCTTGATA
TCGCTGATAA TTGTGATTTT GGGCGTACCG GCCGCCCTCT GGGCGATCGA TATGTATTAT
ATGCCTCTTG ACCAATTGGC TGAGCGGGTC CTGAGTCGTC TAGGCGTGGG TTGA
 
Protein sequence
MNFDLRFYLR LFIRRLPAMM TIIIVCSAIG VVLAMRLPPT YSTSARLLME SPQIPDALAA 
AIVNVSPDEE IEIIRQRLLT RVNLLNIANE MNVFENYAAT PADEMVQHMR AHTRIRSTGG
QNQPTLITVS FEARAPQIAA NVVNEYVTRI VSANVELRTG LAEDTLSFFE QEVSRLSTEL
DLRSGYIAEF QIENADALPD DQAFRLNRQA LLQERIGSAE RERATLQDQR ARVIAIFEQT
GQINSGNPET LSPEEQQLHA AEAELANALT IFSETNPQVQ LLQRQVERLR EQIAAGIETG
QEEGENDDAR SSTRTVLDLQ LTQIDTQITA LDTLIAETQQ ELQRLEDAIS RTPSNAITLS
SLERNYENIR DQYDQAVARL ADASTGERIE LTSRGQRISL IESANVPRSP SSPDRPMVAA
TGVALGIGLA GALFLLLELL NRTVRRPVEI TNALGIQPLA VIPYLDTQSL RLRRRLIRLI
SLIIVILGVP AALWAIDMYY MPLDQLAERV LSRLGVG