Gene Jann_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1039 
Symbol 
ID3933483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp992039 
End bp993238 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID637903387 
Producthypothetical protein 
Protein accessionYP_508981 
Protein GI89053530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCA GCAGAGAGGG CTCCGGCGCG GCCCGTTTCA CGCGACCCAC GCAGCAGATC 
CTGCTGATGT TGATCATCCT TGCCCTTGTG TTGGCGGGGG GCATTTTGGT GTGGCCCCGG
GTGCAGGATG TGTTCCTGAC CTCCCCCTAT CTCAACGGCA CAATCGGGAT CGTCTTCGTG
GTCGGCGTCT TCGCGACCTT CTTTCAGGTG ACGCAGCTGT TCTCCTCGGT CGCGTGGATT
GAGCATCTGG CGGGCGGATC GAAGACCGAT GAGGATGAGA AACCGCCGCG CCTGCTGGCG
GCGATCTCGG GCGTGGCGCG GTTGCGCGGG TCCCGCACAC AAGTGACGCC TGCCTCTGCG
AAATCCATCC TTGATAGCGT TGGCGCGCGG ATGGAGGAAA GCGGCGACAT CACGCGCTAC
ATCGCCAACC TTCTGATTTT CCTCGGCCTT CTGGGCACGT TCTTTGGCCT TGCAACAACA
GTGCCCGCGG TGGTGGAGAC CATCCGATCC CTGCAACCCA CGGATGGGGA AGAAGGGCTG
GCCGTGTTCG GGCGCCTTAT GGATGGCTTG GACGACCAGC TTGGGGGCAT GGGGACGGCG
TTTGCCTCCT CGCTGCTGGG CCTTGCCGGA TCCCTCGTGA TCGGCCTGTT GGAACTTTAT
GCGGGCCATG GCCAGAACCG GTTTTATCGG GAGTTGGAGG AATGGCTGGC CTCCATCACC
CGCGTATCCT TCTCTGGCGA CGGCGACGGG GCCATCGACA AGGCGGCGAT CGCCACTGTG
CTGGACCATA TGGTTGACCA GATGGACACG CTGCAATCGC TATTTTCCCA GTCCGAGACG
CGTCGCGCGG CCACCGAACA GCGCGTTCTG ACCCTGGCGC AGAGCATTGA GGGCCTGACT
GATCGCCTTG GTCCGGGGCA GGTGGCGGCG GTTGAACGGC TGGCCACGGC ACAAGATCGT
CTTGCCGCAG CGCTGGACGG TGTGGCTGCG GAGCAGGGGC TTGATGACGA ATCCCGCAAC
AGGTTGCGGT CCATTGACGT GCAATTGTTC AAAATGGCTG AAGAGATCGG CACCACCCGC
GACGCCGAGG TTATGGGGCT GCGCGGGGAT CTGGCGCATC TGACCGAAGC CTTGCAAGAA
CTGACCCGCG CCGCTCGCGC CCCGGCGCAG GCCCGGGTGC GCCAACGCGG GGACAGCTAG
 
Protein sequence
MASSREGSGA ARFTRPTQQI LLMLIILALV LAGGILVWPR VQDVFLTSPY LNGTIGIVFV 
VGVFATFFQV TQLFSSVAWI EHLAGGSKTD EDEKPPRLLA AISGVARLRG SRTQVTPASA
KSILDSVGAR MEESGDITRY IANLLIFLGL LGTFFGLATT VPAVVETIRS LQPTDGEEGL
AVFGRLMDGL DDQLGGMGTA FASSLLGLAG SLVIGLLELY AGHGQNRFYR ELEEWLASIT
RVSFSGDGDG AIDKAAIATV LDHMVDQMDT LQSLFSQSET RRAATEQRVL TLAQSIEGLT
DRLGPGQVAA VERLATAQDR LAAALDGVAA EQGLDDESRN RLRSIDVQLF KMAEEIGTTR
DAEVMGLRGD LAHLTEALQE LTRAARAPAQ ARVRQRGDS