Gene Jann_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2004 
Symbol 
ID3934457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2007963 
End bp2011463 
Gene Length3501 bp 
Protein Length1166 aa 
Translation table11 
GC content47% 
IMG OID637904360 
Producthypothetical protein 
Protein accessionYP_509946 
Protein GI89054495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.731586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAC CGTCAGTAGA GAGCTCTATT GTCCGAATTA TTCGCATCAG TGACCAGCAG 
TGGGTAGGCT GCGGATTCTA TGCGAAATTT GGCGACCTGA GCGTGCCGCC CCTAGTTGTT
ACTTGCGCAC ACGTAGTTGC CGATGCGATC GGTGACCCTT CTGTTGCAGA GCAAGCCAGT
AGACCAAGCG GAACTTTTAG AATTGACTTC CCCAATTCGC TGCAAAACCG TCAGATTGAA
TGCAATGTGG TCGACGCGAA AAACGCTTGG CATCCCGCTA CTTCCACCCT GTCGGATCAT
TCTAGACCAT TTGATATCGC CTTCCTTCAA CCACCTAGTC TAGAAATCCC AATCGATATA
AAGCCCATAG ACATCAAAGT CGATTGCACA TTTACCTCGG GACAGAACTT GAATGCGTTT
GGTGGTGCCG AAGAAGGCAA CTTAGTCGAC GACGAGTTCG GAGAATGGGT GCGCGGTAAA
CAAGTCCGGC GAATGCACGG ACGTTTTCAG TTTAAGTCCA ATCCAAGTGA GACACTAAAC
GTCGTTAGAG AAGGCTTCTC TGGCACTCCA ATCATCACAG AAAAAAATGA TGACGTCATT
GGAATGGTAA CTGTTGCGGA CGAAGGCAAG GGCTTAGCAT GGTTTCTTCC AGTTCAGACG
CTCTTCGAAG CCATTAACGG CGGTCCCGTC GAAGAACCAG AGGAGATAAC CGTTGATCCT
CGCACCTTTT CCCTTGATGA CGTTCCTATC GATCTGCACC CACCTTTCGT AGAGACCTAC
TTCAAAAGAA AATCTGTTCA AGAAACGATC GAGGCCAAAT TCGATAATTT TCTATCGATA
GGAATTTACG GCGGTACAGA TTCCGGCAAG ACTGGCGAAA CATCCATTTT CGTGCAGCGT
TTGGTGGAAT CTGAGCAACG CCCATGCGTT TGGTTCGTTT GTTCTGAAAC TAGTTCGATT
ACAGAACTTA TTGAAGTTGT ATCTGAAGTT CTGGGCGAAC ACTGCGGCGA GAAGCCGACC
TTTAGGCACT TGCTCCGCGC GCTGCACCAA ACGAACTTGT TGTTGATAGT TGACCAGATC
AACGCGGTTT CTGAAAGGAA CCGACAAGGC TTCTTTGGTC TTCTCGATCT GTGCAGGAAT
AGAAAAAGCA CCACTTCTCT TATTTTCATC CTCTCAGATG ACCACTCTGC TTACAAGGAT
GACGCTTCCA CAGTTTGTCT GAGTGCAATG TCACGAGAAG AAGTGGTCGA GATCATTTCA
CAAGAGGCCG GTCTCCCAGT GGAGGACGTT CAAGATGCTC TTTATGCCGA CAGATACCGT
TGTTCGGAGG TCTCGCTTCT TCTTCAAAGA ATCGTTGAAG ACCCCAGCAG GATTCACGAA
ATCGCTAAAG AGAATGTTGC GCTTAACGAC ACTATCTCAT CAGAATTCAA ACCACACGAG
CGCGCTGTAC TTACTGCATT GGCAATGACT GATCAGTATG TCGACAATGA CTTTTTCCGA
AGAGTCTGCG AAACGATAGG TGTACAAGAT TGGCAAGACC TTCAAGACCG GTTGTTCGGC
TTCGGACTGC TTCGCACTGC GGGCACGCGT AGGATTTTTG TAACCAATTT TGTCCGGGAA
AGAATTCGTT CTTCAACTCC CACGGTGAGT CAAAAACAAG CCGATAAGAT TCTTGGTACG
GTTTGGTTGG AGAAACACAA AGAACAAGAT CAAGGCGAAA CGCCCAAATT CGACATTTCT
AGCGACGACA TGATCTACTT GGCAATTCGT CACTTTCAGC GCAGCATGAG CGCGGAGCAG
ATGGTGGAAT CGCTAATAAG CGAGCACCGT TGGAATTTGG CTAGAGCTGG CCACCATCGA
TATCTCGTTG CGATCCTTGA TTTCGAGTTC AAGACAAAGC GGCGAATGCC AATATGGTCC
CAGTATCAGT ACGTTCAAAG CCTTGTCGCA ACAGGCGAAA TAACTTCTGC TTTCTCAGCC
ATGAAGCGAA TTTGTTCGAT TTTGCTTACA AGTCGCAATC TGAAGCCAGA CTCGCTGATG
CAGTTATGCA CAAGATTTTC TGACGTTCTT CAGAACGCTC GGCACTACGA TTTGGCTCAA
GAACTACTGG AAACTGCACT ACAGGAAATC GACAACTCCC AGCTGCAAGC TTCAACGCTT
CGGATCGCAA CATCCCATCT CTTATGGAGT AAGATATATT GTGAGCCTAA AATTGGAATT
GCGTACGAAC TGATATCATT GCGCGAAGAA GCACTGCTGG CGCGAAATGA TCTCGCGGTT
GCGATTGAGT CCACAAGACT GGGTGTTCTT TACCTAAATC TAAAAATGGC TGATAAGGCT
GCTACCGAAC TCCTGCAGGC CGTAAGGTTC TTCCAACATA CAGACCAGCG AGGCCTTATT
TGGTCACTTG GCCACTATTC AGTGGCTCTA GTGCGAAGTG ATCCACTGAA CGTACCCCTT
GATGAGTTAG AATGGCTTTG TGAATGCATA CAGAAATTCG ATCTTCTTAG CGAAGAGACT
TACGAGTATT TTCGTCTGTT TGAGGAGCTA CTTCACGGTG AAGAGCTTCA AGAACGAGTT
GCCTCCGTAA AAGAAACCGC TCTAGAAGCT TCGAACAACA GAGAACTGAG AGAAGAAGAT
TTAGAATTCG TTTTGGTCTT GCAAGAGTTT CTGAAAGAGA ATGGCTTGGA CGAACAATCG
AAGAAACGAC AAGGCACGGC TGATACAGAT AACGTGCGAG GCAAAACTTC GTTTCTTCAC
TCGGCTCAAT TCAAAATGGA TTCGCAAGCA AATAAGTCGT TTGTAAGGAA CCTTGTAAGT
CGTGATCCGG AGGGTGTCGC TTCAGATCTG TTCGATCGCT ACTCCCTCCA ACGCATCTTT
CGCACACCAA TCCTATCTAG TGTGTTGGTC GCCTGCTGCA AAAGGTCTCG CAACGAAGAG
CTAGTCGACT CCTACGTCGT ATCCAACTTA GGCGTGATAT TGTCGTCCAG AGACGACGTT
AAACTCTTCT TTGCGAGGTT CTTGGAGCAG GTAAAGAAAG ATGCGGAATG CGAAGAATTG
CTCGACTCAG TTGGAAAAAA GTCAGGCTTC AACTTCTATA ACGTTTCTGC AAACCTGTAC
TCCAGAAAGA GTTTTGAAAC TGCGCTCGAC TACAACCAAA AAGCACTCAC AGCATCCGCA
AAAGCCTCTC AAAGGGCGAG AGTCAATAAT AACATAGCGG TCTTGATACT TGAGAATGGC
AAATCGGACT TACTTGCAAA GGCTAAGCTC CATATTCAAG AATCCCTCAA AGAGAAATAC
CGTGGATACA ATTGGCCCAA TCGAACCAAA CTTGCAATTG ATGTAAATTG TGCCGAAAGC
CAAGAGCTTA ACCAAATAGT GGCGGGTTAT TTCGAACGGT CCGGCGACGA CCAAAGGACT
GTAAAATACA CTTCGCGCCT AATCATTGAT CCTGACCGGC GTATTGAGTT TCTCTCAGCC
TGCGACGCAA TCCTAGACTA G
 
Protein sequence
MHKPSVESSI VRIIRISDQQ WVGCGFYAKF GDLSVPPLVV TCAHVVADAI GDPSVAEQAS 
RPSGTFRIDF PNSLQNRQIE CNVVDAKNAW HPATSTLSDH SRPFDIAFLQ PPSLEIPIDI
KPIDIKVDCT FTSGQNLNAF GGAEEGNLVD DEFGEWVRGK QVRRMHGRFQ FKSNPSETLN
VVREGFSGTP IITEKNDDVI GMVTVADEGK GLAWFLPVQT LFEAINGGPV EEPEEITVDP
RTFSLDDVPI DLHPPFVETY FKRKSVQETI EAKFDNFLSI GIYGGTDSGK TGETSIFVQR
LVESEQRPCV WFVCSETSSI TELIEVVSEV LGEHCGEKPT FRHLLRALHQ TNLLLIVDQI
NAVSERNRQG FFGLLDLCRN RKSTTSLIFI LSDDHSAYKD DASTVCLSAM SREEVVEIIS
QEAGLPVEDV QDALYADRYR CSEVSLLLQR IVEDPSRIHE IAKENVALND TISSEFKPHE
RAVLTALAMT DQYVDNDFFR RVCETIGVQD WQDLQDRLFG FGLLRTAGTR RIFVTNFVRE
RIRSSTPTVS QKQADKILGT VWLEKHKEQD QGETPKFDIS SDDMIYLAIR HFQRSMSAEQ
MVESLISEHR WNLARAGHHR YLVAILDFEF KTKRRMPIWS QYQYVQSLVA TGEITSAFSA
MKRICSILLT SRNLKPDSLM QLCTRFSDVL QNARHYDLAQ ELLETALQEI DNSQLQASTL
RIATSHLLWS KIYCEPKIGI AYELISLREE ALLARNDLAV AIESTRLGVL YLNLKMADKA
ATELLQAVRF FQHTDQRGLI WSLGHYSVAL VRSDPLNVPL DELEWLCECI QKFDLLSEET
YEYFRLFEEL LHGEELQERV ASVKETALEA SNNRELREED LEFVLVLQEF LKENGLDEQS
KKRQGTADTD NVRGKTSFLH SAQFKMDSQA NKSFVRNLVS RDPEGVASDL FDRYSLQRIF
RTPILSSVLV ACCKRSRNEE LVDSYVVSNL GVILSSRDDV KLFFARFLEQ VKKDAECEEL
LDSVGKKSGF NFYNVSANLY SRKSFETALD YNQKALTASA KASQRARVNN NIAVLILENG
KSDLLAKAKL HIQESLKEKY RGYNWPNRTK LAIDVNCAES QELNQIVAGY FERSGDDQRT
VKYTSRLIID PDRRIEFLSA CDAILD