Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1077 |
Symbol | |
ID | 3933521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 1038768 |
End bp | 1040336 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637903425 |
Product | extracellular solute-binding protein |
Protein accession | YP_509019 |
Protein GI | 89053568 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.108497 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTAC GCAACCTGAT GGCCATCACG CTTCTTGCCG GCTCGCTGGC GCTGCCTGCC GCAGCCCAGG AGCGCCCGGA TCTGGTCATC GCCGTCGACA ACCTCTGGCC AACGATGGAT CCGGTTATTG GCATTTCAAC GACCGGTGCC CGCGTGCACA CGAACATCTT TGATACCCTG GTGCGTCGGA ACCGCTGGGA AGATCCCGAT GGCACGGAAC TCGTGCCCTG GCTTGCCGAA AGCTGGCAGC AGCTGTCGCC GAATGTCTGG GAGTTCACGC TGCGCGACGG CGTCCCCTTC CACGACGGCC ACATCATGGA TGCCGAAGAT GTCGCGTTCT CGTTGTCGGC AGAACGGCTC TGGGGCGATG AACCCCTTGC CCCGCGCGGC ACCCGCTACA CGCGCGGCTT TGTGCGGGTG GAGGCGACCG GCCCGCTGAC CGTCGAAGTG GAAACCAGCT TCCCCGATCC GAACCTGCCG TTCCGCATGG TCACGCCCAT CGGCTTTGTC CTGCCGCAGC ATTATTATGA GGAGGTCGGA ACAGAAGCCT TCGGGCAGAT GCCGATGGGC ACCGGCCCGT ACCGGATGAC CGCGTTTGAC CCCTCCGTCG CCGTAGAGGC GGTGGCCTTT GACGATTACT GGGCCGGAGA GCCGCCCGCC GCCACGCTGC GCTTCGAGAT CGTGCCCGAG TTCTCCACCC GGTTCGCGGG CCTTGTGGCG GGCGAATATG ACGTCGTCGT CAGTGTGCCG TCCGATCAGG TCGATGTGCT CGACGGCACC GACGGCGTGA CGGTGCTGGA GAAGGGGATC GAGAATTATC CGATGTTCGC CTTCAACATG CTTGAGACCG ACGCGCTGCC CGACAACCCG CTGACGGATG TGAACCTGCG CCGCGCCATT GTGGCCGGTG TGGACCGCGA CGCGATTGCA GCGGCGATCT GGGGCGACGC AACCTTCGTG CCGACACCCT TCAACTTCCC GGAATATGGC GATTATTTCG ATCCGGACCG CGAACGCGCT GTCCCGTTTG ATCCCGCACA GGCCGCCGAA TACCTGGCCG CCAGCGACTA TGACGGCGAA GAGCTGATCT GGCACATCAC CCGGGGCTTC TACCCGAATT ACGAGATCGC GGCGGAGTTC ATGGTGGAGC AATGGGCCGA GATCGGGATC AATGTGCGCA TCGAGTTGAA GGACAATTTC TCACTCGCCT ATGAACGCCC GTTCCACTTC CTCAACATGT CGATGAGCTC CGAGTTTTCC GGCGATCCCT ATCGCCCGCT CTGGATGGAT TGGGGCCCTC TGTCGAGCCG GTTCCGCGCC TCGCACCGGA CCTGGGACAT GACGGAAGAG TTTGTGACCC TTGGCGAAGC GTTTGAACAG ACACAGGATT TTGAGGGACG CTATCAGGCC TATCTGGATC TGGTCGCGGA ATGGGAACGC GTCACGCCCG GCATGTATAT GTGGCGCAAC GTGGTCAGCT ACGCGATCAA CGAAGACCTG CAATGGGATC CCGGCAATTC CGCCGTCACC ATCTTCGATC ACCTCTATAT GACTGGCTAT GATGGGTGA
|
Protein sequence | MTLRNLMAIT LLAGSLALPA AAQERPDLVI AVDNLWPTMD PVIGISTTGA RVHTNIFDTL VRRNRWEDPD GTELVPWLAE SWQQLSPNVW EFTLRDGVPF HDGHIMDAED VAFSLSAERL WGDEPLAPRG TRYTRGFVRV EATGPLTVEV ETSFPDPNLP FRMVTPIGFV LPQHYYEEVG TEAFGQMPMG TGPYRMTAFD PSVAVEAVAF DDYWAGEPPA ATLRFEIVPE FSTRFAGLVA GEYDVVVSVP SDQVDVLDGT DGVTVLEKGI ENYPMFAFNM LETDALPDNP LTDVNLRRAI VAGVDRDAIA AAIWGDATFV PTPFNFPEYG DYFDPDRERA VPFDPAQAAE YLAASDYDGE ELIWHITRGF YPNYEIAAEF MVEQWAEIGI NVRIELKDNF SLAYERPFHF LNMSMSSEFS GDPYRPLWMD WGPLSSRFRA SHRTWDMTEE FVTLGEAFEQ TQDFEGRYQA YLDLVAEWER VTPGMYMWRN VVSYAINEDL QWDPGNSAVT IFDHLYMTGY DG
|
| |