Gene Jann_3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3450 
Symbol 
ID3935924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3497884 
End bp3499890 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content61% 
IMG OID637905824 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_511392 
Protein GI89055941 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.472971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG ATACCGACGA TCGTAAGCAG GACGGCGACG AGACCGAAAT GGGCCTGGAT 
ATGAGCCAGG CCGCCGTCAA GAAAATGATC GCGGAGGCTC GGGTCAGAGG CTACATCACC
TACGATCAGC TCAACACCGT TCTGCCGCCG GAGCAGGTCA GCTCGGAGCA GATCGAAGAC
GTCATGTCGA TGCTGTCGGA GATGGGCATC AACATCATCG AGGATGATGA GGCCGAGGAA
GGCGACGACA AACCCGCCGG CGAATTGGTG ACCACCGAAG GCTCCCGCGA GGTCGCGGTT
GCAGCCACGG AGACCGAGAA ACTCGACCGC ACCGATGACC CCGTCCGCAT GTATCTGCGC
GAAATGGGCA GTGTGGAACT GCTGTCGCGT GAGGGCGAGA TTGCCATCGC CAAGCGGATC
GAGGCCGGGC GCAACACGAT GATCGCGGGC CTGTGCGAAA GCCCGCTGAC CTTCCAGGCG
ATCACCATCT GGCGTGACGA ACTTCTGGAA GAAGACATCC TGCTGCGTGA TGTCATCGAC
CTCGACACCA CCTTCGGGCG CACGATGGGC GATGAAAACG ACACCCCGGT CGTGCCCGCA
GGCGTCGGCT CCACGCCGCC ATCCGCCACG CCCGCCCCCC CGCCGGAGCC AAAGAAAGAA
GAGCCTACGC AGGAACTGGA CGCCGACGGC AACCCCATCG CCAAGGACGA CGATGAGGAT
GAGGACGAGC AGGCCAACAT GTCGCTCGCC GCGATGGAAC TGGCGCTGAA GCCCCAGGTC
CTCGCCACGC TTGACCAGAT CGCCAACGAT TACATCCGCC TGTCGGAAAT GCAGGACAGC
CGGATTTCGG CGACCCTGAA CGAAGATGAC AGCTTCTCCA AGGCAGAGGA GGCCGAATAC
CAGCACCTGC GCTCCGAGAT CGTGGAACTG GTCAATGAGC TGCACCTGCA TAACAACCGC
ATCGAGGCGC TGGTGGACCA GCTTTACGGC ATCAACCGCC GGATCATGTC CATCGATTCC
AGCATGGTGA AGCTGGCCGA TCAGGCCCGC ATCAACCGCC GCGAGTTCAT CGAAGAATAC
CGCAACCAGG AACTTGATCC GACCTGGATG GAGCGGATGA CGCAAAAGTC CGGCCGCGGC
TGGCAGGCGC TGATGGAGCG CTCGTCGGGC AAGATCGAGG AATTGCGCGG GGATATGGCG
CAGGTCGGTC AGTATGTGGG CCTCGACATC CCCGAATTCC GCCGCATCGT GAACCAGGTC
CAGAAGGGCG AGAAAGAAGC CCGCCAGGCC AAGAAGGAAA TGGTCGAAGC CAACCTGCGT
CTGGTGATTT CCATCGCCAA GAAATACACC AACCGGGGCC TGCAATTCCT TGATCTCATT
CAGGAAGGTA ACATCGGCCT GATGAAGGCC GTCGACAAGT TCGAATACCG CCGGGGCTAC
AAGTTCTCCA CCTATGCGAC GTGGTGGATC CGTCAGGCGA TCACCCGCTC CATCGCCGAT
CAGGCCCGCA CGATCCGCAT CCCGGTCCAC ATGATCGAGA CGATCAACAA GCTGGTCCGC
ACCGGTCGCC AGATGCTGCA CGAAATCGGC CGGGAGCCGA CGCCGGAGGA ATTGGCCGAG
AAGCTGCAAA TGCCGCTGGA GAAGGTCCGC AAGGTGATGA AGATCGCCAA GGAGCCGATC
AGCCTTGAGA CGCCCATCGG CGACGAGGAG GATTCACAGC TTGGCGATTT CATCGAGGAC
AAGAACGCCG TCCTGCCGCT GGATTCAGCG ATCCAGGAGA ACCTCAAAGA AACCACAACC
CGGGTTCTGG CCTCCCTCAC CCCCCGCGAA GAACGCGTCC TGCGCATGCG CTTCGGCATC
GGCATGAACA CCGACCACAC GCTGGAAGAA GTCGGCCAAC AGTTCAGCGT GACCCGCGAA
CGGATCAGGC AGATCGAGGC GAAGGCGCTG CGGAAGCTGA AGCACCCCAG CCGGTCAAGG
AAGCTGCGGT CGTTCCTGGA TCAGTGA
 
Protein sequence
MAKDTDDRKQ DGDETEMGLD MSQAAVKKMI AEARVRGYIT YDQLNTVLPP EQVSSEQIED 
VMSMLSEMGI NIIEDDEAEE GDDKPAGELV TTEGSREVAV AATETEKLDR TDDPVRMYLR
EMGSVELLSR EGEIAIAKRI EAGRNTMIAG LCESPLTFQA ITIWRDELLE EDILLRDVID
LDTTFGRTMG DENDTPVVPA GVGSTPPSAT PAPPPEPKKE EPTQELDADG NPIAKDDDED
EDEQANMSLA AMELALKPQV LATLDQIAND YIRLSEMQDS RISATLNEDD SFSKAEEAEY
QHLRSEIVEL VNELHLHNNR IEALVDQLYG INRRIMSIDS SMVKLADQAR INRREFIEEY
RNQELDPTWM ERMTQKSGRG WQALMERSSG KIEELRGDMA QVGQYVGLDI PEFRRIVNQV
QKGEKEARQA KKEMVEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY
KFSTYATWWI RQAITRSIAD QARTIRIPVH MIETINKLVR TGRQMLHEIG REPTPEELAE
KLQMPLEKVR KVMKIAKEPI SLETPIGDEE DSQLGDFIED KNAVLPLDSA IQENLKETTT
RVLASLTPRE ERVLRMRFGI GMNTDHTLEE VGQQFSVTRE RIRQIEAKAL RKLKHPSRSR
KLRSFLDQ