Gene Jann_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2023 
Symbol 
ID3934476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2026530 
End bp2028680 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content60% 
IMG OID637904379 
Producthypothetical protein 
Protein accessionYP_509965 
Protein GI89054514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.106558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAG CGCTCATAAT CGCCCGGCGC ACACTTGGTG CGATCGTCTT GGCGACCGCC 
GCGGGCGCAG CCCATGCGCA ATCTGACTCT GCGGGCGAGC CCGCCTGGGT GTATGGCGGG
GCACCTGAGG GCGTACGCTT GGCTTGGCTG GATGAAACTG CCACGGAAGT CCTTTCAATC
GGATGCGCCT CCGAGCAAGG GGGCTCGCGC GTAAATGCGG ATATGACCCT AGGACCGGCC
ATCGCGCCGT CCGCCAATCC GATCCTCTTT CAAACTTCGG AACAAAGCCT GGTTTTCCGT
CCTCCGCAGT TCTTCAACGG AGAACGCGGC CCGCTTCGGA TACAATTCCC ACGCCGGTTC
ATGGCGAACC TCAGACAAGT CGGGGAGGAC GCGCAGCTCA CCCTTGCGAC CGATCAGGCT
CAGGACCTGC AGATTGACCT CGCCCCAGCT ATCCCGTTGA TCGAACGGCT GGTTGGGCAT
TGCCAGCGCG GGGCGGGTGA CGGGTTGCCG TCTTTTGTCT GCACCAATGC GGCCACCGCA
ACGGATGGGG CGGTCTGTCG GGATCCGGAG GCATCAACTC TGGATCGCAC CGTAGCCGAC
ACTTATTTTG CGGCCTCGAG TGCGCTGTCA GGCAATGAAA GAATCCTTGC GCACGGACAA
AACGAGAACT GGCACGCTGG CCGCCTGACT TGTGAACAGG ACCTCGCATG TATTGCAGCG
CAGGCTGAGG CATTTCTCGC CTCCCTTGAG GCTATTGCGC CTTCTTCCGC TCCGACGTCC
ATGCCCGAAA ACCCCGCCTT TGCCGCCTGG CGCACCAACC CCGATCTGGC GCGCGCAGAG
GCCGCCACGC ACGGGGTCGA CAACGCCATC TTGCTCGATA TCCGAGGCAA TACCGCCTTC
CTCGCAAGTA GATCTGTCAC GTCGGGCACA TGGATAGCCA TACACGCCGT AGATGCGCAG
ACAGTTGTCC CGGAAGGCCC GGACCGATCA ACGATTATCG CGGAACTGAC GCGCAACGCA
GGACTTCAGT TTCCTGAAAT ATCACTGTTT CACGTGCGTC TGGGGCATCC GCTTCCGGAT
GGTTTGGGAG GACGCGGCAC GTTCGGCACA GCCGTAGTTC AGGAGAATTT TCGACCCTCC
ATCGCCGGAA ACCACTCTGC GAATAGACGA CTAAGCTGGA GTCCCGGCTA CGCCGATGCG
CCGCGCAGTT GGGCGGATGC TGAAGCTGAA CTATTGGGGC GCGAAGCGGC CAATGAAGCT
GCACGACAAG AGGCTTTGGC TGCAGCAGCC ATTGCAAGAC AGGCTGCACA AGATGCTGCA
CGCGCCAGCG CGAGCGAAAT CATCGATAGT GGTCAGGTCT ACCTTTCGCC GGGTTTCTGG
ACTGATTTCT ACGATCCATC AGTTATTCGT TCGATTTTCC ACGGGACTGC GACTTACTCG
CCGGAGGCCC CGGCTCTCAT CCGCTCTACG GCGGGCTATG TGATGGCGTA CAGCGCGCGC
TGCAGTTCAT ATCTTCCCGC CGATGCTGTT GTTTTCACGG ACGTCGAGAC CACCCGGATC
TTGAACGGGT TTGGGGTGCA ACTGCATGCT TCGGAAAATA TCCGCCAGTT GTCTGTTCAC
CCGCGTCTGG CCGGTGTGAT GGCACAGCAC CGCAGGTCAG AGTCTGGCGG GGGAGGCCTA
CAGCAGGGCT TGGTTCTTGC GGACGGGGTG TTCTCGGGTC GCATATCGTT GGGTGATGCG
ATCAATCAGG TTGCGGGCTT TGCCATAGAT GCACAGCGAC TAGTGACAAG CTTCCCATGT
GACAGCGCGC CGGTCCGGCA GTTGTTCGAA AATCTGGTCG CGCTGACAGG CGACACACCC
ACTTTGGCAC GAACAGGTAT CGGCGTTGAG GGCGCGGCTG CATACAGCGA TCCGGTTCCC
GGAGCCGCGC CGTTCATCAG GGTCTCCGAT GCGTGCATGT ACTGGGCGAT CGACGCAACG
CTCTCCGCGC CGTTCTGCCG ATGCCTGGAA AACCGGCTTG TCGCCCCTGA ACGGGAGGTG
GCTTTGTCCG ATTTCGCATC CTTTATGCGA GACTTTGGGC TTGAAGGTTC GCAAGCGGAT
ACGCCCGGCG CGCGCAGACA ACTGGCTTGT ATCCGAGAAA CAGCTCGGTA A
 
Protein sequence
MNSALIIARR TLGAIVLATA AGAAHAQSDS AGEPAWVYGG APEGVRLAWL DETATEVLSI 
GCASEQGGSR VNADMTLGPA IAPSANPILF QTSEQSLVFR PPQFFNGERG PLRIQFPRRF
MANLRQVGED AQLTLATDQA QDLQIDLAPA IPLIERLVGH CQRGAGDGLP SFVCTNAATA
TDGAVCRDPE ASTLDRTVAD TYFAASSALS GNERILAHGQ NENWHAGRLT CEQDLACIAA
QAEAFLASLE AIAPSSAPTS MPENPAFAAW RTNPDLARAE AATHGVDNAI LLDIRGNTAF
LASRSVTSGT WIAIHAVDAQ TVVPEGPDRS TIIAELTRNA GLQFPEISLF HVRLGHPLPD
GLGGRGTFGT AVVQENFRPS IAGNHSANRR LSWSPGYADA PRSWADAEAE LLGREAANEA
ARQEALAAAA IARQAAQDAA RASASEIIDS GQVYLSPGFW TDFYDPSVIR SIFHGTATYS
PEAPALIRST AGYVMAYSAR CSSYLPADAV VFTDVETTRI LNGFGVQLHA SENIRQLSVH
PRLAGVMAQH RRSESGGGGL QQGLVLADGV FSGRISLGDA INQVAGFAID AQRLVTSFPC
DSAPVRQLFE NLVALTGDTP TLARTGIGVE GAAAYSDPVP GAAPFIRVSD ACMYWAIDAT
LSAPFCRCLE NRLVAPEREV ALSDFASFMR DFGLEGSQAD TPGARRQLAC IRETAR