Gene Jann_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3420 
Symbol 
ID3935894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3467291 
End bp3469342 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content68% 
IMG OID637905794 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_511362 
Protein GI89055911 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCG CAGGCGATAG CGGCACGGAG CCTTCCGAGG CCCCGTCCGC GCCCCGCCGC 
CGTGCCTATC ACTACAATGC AGGTTTCCTG ACCAACACGC GCGTGCGCCG CATCCTGGCG
TTGGCGGGCT ATGACTTGAA GCTCGGCACA CCGGACGCGG CGGATGACGT CATCGTCTGG
GGCCATTCCC CCTATGCGCC GCGCGGTGAG GCTGTGGCCG ACAGCACCGG CGCCCATCTG
GTGCGGGTGG AGGATGCGTT CCTGCGCTCC CTCCGTCCGG GCCGTTCCGG CGAGCCGCCT
CTGGGCCTCG TGATCGACCG GCGCGGCATG TATTTCGATG CCACGCGCGC CTCGGATCTG
GAACATATCC TGGCCACGCA CCCCTTTGAT GACACGGCGC TCTTGAACCG CGCCCGCGAT
GTCATGGCCC GGATGGCCGA GGGGCATCTG TCGAAATACG CCGCGACGGA CCCGGCACTG
GATCCGCCCG CGCCCGGTTA CGTTCTGCTG ATCGACCAGA CCAAAGGCGA CGCCTCCATC
CAGCTCGGTC AAGCCACGCC CGACAGCTTT GCCGAGGCGC TGACCTGGGC CCGCGAAGAC
CATCCCGACG CCCATATCGT CGTCAAGACC CACCCCGAAA CCCGCGACGG CCACCGCCCC
GGTCACTTTG ACCCGGACGG ACTGCCGCCC AATGTCTCTC TCGATGATCG CCCGATCAGC
CTTTGGCGCA TGTTTGAAGG CGCGCGGGCC GTTTATACCG TGACCTCCCA AGCGGGGTTC
GAGGCGATCC TGGCGGGCCA CAAGCCAGTC ACCTTCGGCG TGCCGTTCTA TGCCGGATGG
GGTCTGACCG ATGACCGCCG CCCGGTGCCC GTCCGTCGCC AACGGGTGCT GACCCGCGCG
CAATTGGTCG CCGGGGCGCT GTTGCTCTAC CCCACATGGT ACGACCCCTA CCGCGACGGG
TTGGGAGAGG TCGAAGACAC CCTCGGCGCG TTGGAGGCCC AGGCCCGCAG CTGGCGAGAG
GACCGCGCGG GCTACACCGC CATCGGGATG AGCCGTTGGA AAAGGGGGCA TTTGCGGGCC
GGGTTCGGCC AACACGGGCC GCTGGACTTC GCCGATCAGC CTGTCGCAGG GCGGCCGACG
CTGGTCTGGG CCGGGAAGGA AACGTCTGAG CTTCAGGCCG CCTGCGGTGA CGCGCACCTG
CTGCGGATGG AAGACGGGTT CCTGCGGTCG CGCGGTCTGG GGGCCGATCT TGTGCCGCCC
CTGTCCCTCG TTCTCGACGA CCTCGGCATC TACTACGACC CCACGCGCGA GAGCCGGTTG
GAGCGGTTGA TTGCAGAGGC CGCAGCCCTT CCTCCCGCAC GCCTGGACCG GGCGGAGCGT
CTTATCCAGA CCCTGCGTCG AACGGGTCTG ACCAAGTACA ATCTGCCCGG CGGGGCGCTG
CCGGATATTC CCCCGGATCG ACCATTGGTC CTTATCCCGG GACAGGTCGA AGATGACGCC
TCCATCCGAC TCGGCGCAGG CGCGATCACC ACCAACGCCG CGCTGTTGGC CGAGGCGCGC
AGGCTCCACC CGGGCGCCTA TCTCATCTAC AAGCCCCACC CCGATGTGGA GGCCGGATTG
CGCATCGGGG TCCTTCCGGA GGAGGCGCGC CACCTGGCCG ATCATATCGC CGAGACGACT
GGGGCGGAGG CGCTGTTGGC CCTCTCCCCC CACGTCATCA CCATGACCTC CGCTATGGGA
TTTGAAGCGC TGATCCGGGG CCTTCCCGTC ACGACCCTCG GCGCGCCGTT CTATGCCGGA
TGGGGCCTGA CGACGGACCT CGGTGACGTG CCCCCGCGCC GCACGGCCCG CCCCTCCCTG
GCCGCGCTCG TCCACGCCGC GCTGATCACC TATCCCCGCT ACATGGACCC TCGCACCGGC
CTGCCCTGCC CCGTCGAAAT CGCGGTCGAG CGGCTGTCAT CCGGCGAGGG TATGCCGTCC
AAACCGATCC TGCGTATCGT GGCAAAATTG CAAATCTGGC TGTCCGGTCA ATCCCGGCTC
TGGCGTCGGT AG
 
Protein sequence
MALAGDSGTE PSEAPSAPRR RAYHYNAGFL TNTRVRRILA LAGYDLKLGT PDAADDVIVW 
GHSPYAPRGE AVADSTGAHL VRVEDAFLRS LRPGRSGEPP LGLVIDRRGM YFDATRASDL
EHILATHPFD DTALLNRARD VMARMAEGHL SKYAATDPAL DPPAPGYVLL IDQTKGDASI
QLGQATPDSF AEALTWARED HPDAHIVVKT HPETRDGHRP GHFDPDGLPP NVSLDDRPIS
LWRMFEGARA VYTVTSQAGF EAILAGHKPV TFGVPFYAGW GLTDDRRPVP VRRQRVLTRA
QLVAGALLLY PTWYDPYRDG LGEVEDTLGA LEAQARSWRE DRAGYTAIGM SRWKRGHLRA
GFGQHGPLDF ADQPVAGRPT LVWAGKETSE LQAACGDAHL LRMEDGFLRS RGLGADLVPP
LSLVLDDLGI YYDPTRESRL ERLIAEAAAL PPARLDRAER LIQTLRRTGL TKYNLPGGAL
PDIPPDRPLV LIPGQVEDDA SIRLGAGAIT TNAALLAEAR RLHPGAYLIY KPHPDVEAGL
RIGVLPEEAR HLADHIAETT GAEALLALSP HVITMTSAMG FEALIRGLPV TTLGAPFYAG
WGLTTDLGDV PPRRTARPSL AALVHAALIT YPRYMDPRTG LPCPVEIAVE RLSSGEGMPS
KPILRIVAKL QIWLSGQSRL WRR