Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_2047 |
Symbol | |
ID | 3934500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 2051067 |
End bp | 2052656 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637904403 |
Product | extracellular solute-binding protein |
Protein accession | YP_509989 |
Protein GI | 89054538 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.101703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.665186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGT TCAAACCGGC CCTGCTGTCG GCGATGCTGG CGCTGCCGCT GGCAGCCCCC GCGGCCCTTG CGGAAACGCC GCCCGAGATC CTGGTGGTCG CGCAGAATAT CGACGACATC GTCGCGATTG ACCCGGCTCA GGCCTATGAG TTCACCTCTG GCGAGTTGGT GACGAACACC TATGACCGTC TTGTGCAATA CGATGCCGAA GACACGACCG TGCTGGCCCC CGGTCTTGCC ACGGAGTGGG AGATTGACGC CGAGGCCAAG ACCATCGTCT TCACCATGCG CGAGGGTGTG ACGTTCCATT CCGGCAATGC GTTCACGGCG GATGACGTGG TTGGCTCCTT CGCCCGCGTG GTGCAGTTGA ACCTGACGCC TGCGTTCATC CTGACGCAGC TTGGCTGGAC GCCCGAAAAC GTCGAGGAGA TGGTGACAGC GGACGGCAAC ACTGTGACCG TGCGCTATGA TGGCGACTTT TCCCCGGCCT TCGTGATGAA CGTGCTGGCG TCGCGGCCTG CCTCCATCGT GGATATGGAA ACGGTCATGG CCAACGCGGT TGACGGCGAT ATGGGCAATG CGTGGCTCAA CGCCAACACG GCGGGCACGG GGCCGTTTTC GCTCAACACC TTCCGCGCGG CAGAGCTGAT CCGCATGGAT GCCAACCCTG ACTATTTCAA CGGTGCCCCC GCCATCGAGG GTGTAATCAT CCGCCATGTC GCGGAATCGG CGACACAGCA ATTGTTGCTG GAAGCGGGTG ATGTTGATAT TGCGCGCAAC TTGACGCCGG ACCAGATCGC GTCACTTGGC GGGGACGAGT TGCAGGTGGA GACGTTCCCA CAAGCAGCCG TCCACTTCCT GTCGTTCAAC CAGGCGGTCG AAAGCCTAAC GCCCCCCGCC GTATGGGAAG CCGCGCGCTA TCTGGTGGAT TACGAGGGGA TGACCAACTC GATCATCGCA GGCCAGATGG AAATCCATCA GGCGTTCTGG CCAGAAGGGT TCCCCGGCGC GTTGACCGAC ACGCCCTACA CCTATGATCC GGAGCGCGCC GCGCAGATCC TGGAAGATGC AGGGATTGAG CTGCCGATCA CCGTCACGCT CGATGTGATC AACGCAGCAC CCTTCACCGA TATGGCGCAA TCGTTGCAGG CGTCTTTCGC CGAAGCGGGC ATCGAGTTCG AAATCCTGCC CGGCACCGGA TCACAGGTGA TCACCCGCTA CCGCGACCGC AGCCATGAGG CGATGTTGCT CTACTGGGGC CCGGACTTCA TGGATCCCCA TTCCAACGCC AAAGCCTTCG CCTACAATTC CGACAATCGG CAGGAAACCT ACACCGCCAC GACGACATGG CGGAACTCCT GGGCGGTGCC GGAAGAGATG AACGCGATGA CGACGGCGGC CCTGACGGAA TCCGATCCGG CTGTGCGTGA AGAGATGTAT CTGGAGCTTC AGCGGCAGGT GCAGGCGAAC TCGCCCATCG TGATCATGTT CCAGGCCTCC TATCAGGTGG GTATGGCCGA GAATGTGTCA GGCTACGTGA ATGGTGCGAC GTCTGACTTC GTGTTCTACC GGCTTGTCGA CAAAAGCTGA
|
Protein sequence | MKLFKPALLS AMLALPLAAP AALAETPPEI LVVAQNIDDI VAIDPAQAYE FTSGELVTNT YDRLVQYDAE DTTVLAPGLA TEWEIDAEAK TIVFTMREGV TFHSGNAFTA DDVVGSFARV VQLNLTPAFI LTQLGWTPEN VEEMVTADGN TVTVRYDGDF SPAFVMNVLA SRPASIVDME TVMANAVDGD MGNAWLNANT AGTGPFSLNT FRAAELIRMD ANPDYFNGAP AIEGVIIRHV AESATQQLLL EAGDVDIARN LTPDQIASLG GDELQVETFP QAAVHFLSFN QAVESLTPPA VWEAARYLVD YEGMTNSIIA GQMEIHQAFW PEGFPGALTD TPYTYDPERA AQILEDAGIE LPITVTLDVI NAAPFTDMAQ SLQASFAEAG IEFEILPGTG SQVITRYRDR SHEAMLLYWG PDFMDPHSNA KAFAYNSDNR QETYTATTTW RNSWAVPEEM NAMTTAALTE SDPAVREEMY LELQRQVQAN SPIVIMFQAS YQVGMAENVS GYVNGATSDF VFYRLVDKS
|
| |