Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_2004 |
Symbol | |
ID | 3934457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 2007963 |
End bp | 2011463 |
Gene Length | 3501 bp |
Protein Length | 1166 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637904360 |
Product | hypothetical protein |
Protein accession | YP_509946 |
Protein GI | 89054495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.731586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAC CGTCAGTAGA GAGCTCTATT GTCCGAATTA TTCGCATCAG TGACCAGCAG TGGGTAGGCT GCGGATTCTA TGCGAAATTT GGCGACCTGA GCGTGCCGCC CCTAGTTGTT ACTTGCGCAC ACGTAGTTGC CGATGCGATC GGTGACCCTT CTGTTGCAGA GCAAGCCAGT AGACCAAGCG GAACTTTTAG AATTGACTTC CCCAATTCGC TGCAAAACCG TCAGATTGAA TGCAATGTGG TCGACGCGAA AAACGCTTGG CATCCCGCTA CTTCCACCCT GTCGGATCAT TCTAGACCAT TTGATATCGC CTTCCTTCAA CCACCTAGTC TAGAAATCCC AATCGATATA AAGCCCATAG ACATCAAAGT CGATTGCACA TTTACCTCGG GACAGAACTT GAATGCGTTT GGTGGTGCCG AAGAAGGCAA CTTAGTCGAC GACGAGTTCG GAGAATGGGT GCGCGGTAAA CAAGTCCGGC GAATGCACGG ACGTTTTCAG TTTAAGTCCA ATCCAAGTGA GACACTAAAC GTCGTTAGAG AAGGCTTCTC TGGCACTCCA ATCATCACAG AAAAAAATGA TGACGTCATT GGAATGGTAA CTGTTGCGGA CGAAGGCAAG GGCTTAGCAT GGTTTCTTCC AGTTCAGACG CTCTTCGAAG CCATTAACGG CGGTCCCGTC GAAGAACCAG AGGAGATAAC CGTTGATCCT CGCACCTTTT CCCTTGATGA CGTTCCTATC GATCTGCACC CACCTTTCGT AGAGACCTAC TTCAAAAGAA AATCTGTTCA AGAAACGATC GAGGCCAAAT TCGATAATTT TCTATCGATA GGAATTTACG GCGGTACAGA TTCCGGCAAG ACTGGCGAAA CATCCATTTT CGTGCAGCGT TTGGTGGAAT CTGAGCAACG CCCATGCGTT TGGTTCGTTT GTTCTGAAAC TAGTTCGATT ACAGAACTTA TTGAAGTTGT ATCTGAAGTT CTGGGCGAAC ACTGCGGCGA GAAGCCGACC TTTAGGCACT TGCTCCGCGC GCTGCACCAA ACGAACTTGT TGTTGATAGT TGACCAGATC AACGCGGTTT CTGAAAGGAA CCGACAAGGC TTCTTTGGTC TTCTCGATCT GTGCAGGAAT AGAAAAAGCA CCACTTCTCT TATTTTCATC CTCTCAGATG ACCACTCTGC TTACAAGGAT GACGCTTCCA CAGTTTGTCT GAGTGCAATG TCACGAGAAG AAGTGGTCGA GATCATTTCA CAAGAGGCCG GTCTCCCAGT GGAGGACGTT CAAGATGCTC TTTATGCCGA CAGATACCGT TGTTCGGAGG TCTCGCTTCT TCTTCAAAGA ATCGTTGAAG ACCCCAGCAG GATTCACGAA ATCGCTAAAG AGAATGTTGC GCTTAACGAC ACTATCTCAT CAGAATTCAA ACCACACGAG CGCGCTGTAC TTACTGCATT GGCAATGACT GATCAGTATG TCGACAATGA CTTTTTCCGA AGAGTCTGCG AAACGATAGG TGTACAAGAT TGGCAAGACC TTCAAGACCG GTTGTTCGGC TTCGGACTGC TTCGCACTGC GGGCACGCGT AGGATTTTTG TAACCAATTT TGTCCGGGAA AGAATTCGTT CTTCAACTCC CACGGTGAGT CAAAAACAAG CCGATAAGAT TCTTGGTACG GTTTGGTTGG AGAAACACAA AGAACAAGAT CAAGGCGAAA CGCCCAAATT CGACATTTCT AGCGACGACA TGATCTACTT GGCAATTCGT CACTTTCAGC GCAGCATGAG CGCGGAGCAG ATGGTGGAAT CGCTAATAAG CGAGCACCGT TGGAATTTGG CTAGAGCTGG CCACCATCGA TATCTCGTTG CGATCCTTGA TTTCGAGTTC AAGACAAAGC GGCGAATGCC AATATGGTCC CAGTATCAGT ACGTTCAAAG CCTTGTCGCA ACAGGCGAAA TAACTTCTGC TTTCTCAGCC ATGAAGCGAA TTTGTTCGAT TTTGCTTACA AGTCGCAATC TGAAGCCAGA CTCGCTGATG CAGTTATGCA CAAGATTTTC TGACGTTCTT CAGAACGCTC GGCACTACGA TTTGGCTCAA GAACTACTGG AAACTGCACT ACAGGAAATC GACAACTCCC AGCTGCAAGC TTCAACGCTT CGGATCGCAA CATCCCATCT CTTATGGAGT AAGATATATT GTGAGCCTAA AATTGGAATT GCGTACGAAC TGATATCATT GCGCGAAGAA GCACTGCTGG CGCGAAATGA TCTCGCGGTT GCGATTGAGT CCACAAGACT GGGTGTTCTT TACCTAAATC TAAAAATGGC TGATAAGGCT GCTACCGAAC TCCTGCAGGC CGTAAGGTTC TTCCAACATA CAGACCAGCG AGGCCTTATT TGGTCACTTG GCCACTATTC AGTGGCTCTA GTGCGAAGTG ATCCACTGAA CGTACCCCTT GATGAGTTAG AATGGCTTTG TGAATGCATA CAGAAATTCG ATCTTCTTAG CGAAGAGACT TACGAGTATT TTCGTCTGTT TGAGGAGCTA CTTCACGGTG AAGAGCTTCA AGAACGAGTT GCCTCCGTAA AAGAAACCGC TCTAGAAGCT TCGAACAACA GAGAACTGAG AGAAGAAGAT TTAGAATTCG TTTTGGTCTT GCAAGAGTTT CTGAAAGAGA ATGGCTTGGA CGAACAATCG AAGAAACGAC AAGGCACGGC TGATACAGAT AACGTGCGAG GCAAAACTTC GTTTCTTCAC TCGGCTCAAT TCAAAATGGA TTCGCAAGCA AATAAGTCGT TTGTAAGGAA CCTTGTAAGT CGTGATCCGG AGGGTGTCGC TTCAGATCTG TTCGATCGCT ACTCCCTCCA ACGCATCTTT CGCACACCAA TCCTATCTAG TGTGTTGGTC GCCTGCTGCA AAAGGTCTCG CAACGAAGAG CTAGTCGACT CCTACGTCGT ATCCAACTTA GGCGTGATAT TGTCGTCCAG AGACGACGTT AAACTCTTCT TTGCGAGGTT CTTGGAGCAG GTAAAGAAAG ATGCGGAATG CGAAGAATTG CTCGACTCAG TTGGAAAAAA GTCAGGCTTC AACTTCTATA ACGTTTCTGC AAACCTGTAC TCCAGAAAGA GTTTTGAAAC TGCGCTCGAC TACAACCAAA AAGCACTCAC AGCATCCGCA AAAGCCTCTC AAAGGGCGAG AGTCAATAAT AACATAGCGG TCTTGATACT TGAGAATGGC AAATCGGACT TACTTGCAAA GGCTAAGCTC CATATTCAAG AATCCCTCAA AGAGAAATAC CGTGGATACA ATTGGCCCAA TCGAACCAAA CTTGCAATTG ATGTAAATTG TGCCGAAAGC CAAGAGCTTA ACCAAATAGT GGCGGGTTAT TTCGAACGGT CCGGCGACGA CCAAAGGACT GTAAAATACA CTTCGCGCCT AATCATTGAT CCTGACCGGC GTATTGAGTT TCTCTCAGCC TGCGACGCAA TCCTAGACTA G
|
Protein sequence | MHKPSVESSI VRIIRISDQQ WVGCGFYAKF GDLSVPPLVV TCAHVVADAI GDPSVAEQAS RPSGTFRIDF PNSLQNRQIE CNVVDAKNAW HPATSTLSDH SRPFDIAFLQ PPSLEIPIDI KPIDIKVDCT FTSGQNLNAF GGAEEGNLVD DEFGEWVRGK QVRRMHGRFQ FKSNPSETLN VVREGFSGTP IITEKNDDVI GMVTVADEGK GLAWFLPVQT LFEAINGGPV EEPEEITVDP RTFSLDDVPI DLHPPFVETY FKRKSVQETI EAKFDNFLSI GIYGGTDSGK TGETSIFVQR LVESEQRPCV WFVCSETSSI TELIEVVSEV LGEHCGEKPT FRHLLRALHQ TNLLLIVDQI NAVSERNRQG FFGLLDLCRN RKSTTSLIFI LSDDHSAYKD DASTVCLSAM SREEVVEIIS QEAGLPVEDV QDALYADRYR CSEVSLLLQR IVEDPSRIHE IAKENVALND TISSEFKPHE RAVLTALAMT DQYVDNDFFR RVCETIGVQD WQDLQDRLFG FGLLRTAGTR RIFVTNFVRE RIRSSTPTVS QKQADKILGT VWLEKHKEQD QGETPKFDIS SDDMIYLAIR HFQRSMSAEQ MVESLISEHR WNLARAGHHR YLVAILDFEF KTKRRMPIWS QYQYVQSLVA TGEITSAFSA MKRICSILLT SRNLKPDSLM QLCTRFSDVL QNARHYDLAQ ELLETALQEI DNSQLQASTL RIATSHLLWS KIYCEPKIGI AYELISLREE ALLARNDLAV AIESTRLGVL YLNLKMADKA ATELLQAVRF FQHTDQRGLI WSLGHYSVAL VRSDPLNVPL DELEWLCECI QKFDLLSEET YEYFRLFEEL LHGEELQERV ASVKETALEA SNNRELREED LEFVLVLQEF LKENGLDEQS KKRQGTADTD NVRGKTSFLH SAQFKMDSQA NKSFVRNLVS RDPEGVASDL FDRYSLQRIF RTPILSSVLV ACCKRSRNEE LVDSYVVSNL GVILSSRDDV KLFFARFLEQ VKKDAECEEL LDSVGKKSGF NFYNVSANLY SRKSFETALD YNQKALTASA KASQRARVNN NIAVLILENG KSDLLAKAKL HIQESLKEKY RGYNWPNRTK LAIDVNCAES QELNQIVAGY FERSGDDQRT VKYTSRLIID PDRRIEFLSA CDAILD
|
| |