Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3804 |
Symbol | |
ID | 3936284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 3889368 |
End bp | 3890873 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637906182 |
Product | extracellular solute-binding protein |
Protein accession | YP_511746 |
Protein GI | 89056295 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000726853 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000406829 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCTCGA TCAAAACAAC CGCCCGCGCG CTGGCACTCA TAAGCACCGC CCTCGCGGCC CCATTGTCCG CGCAAACGCT TGATCTTGCA TGGTCCCAGG ACGCCACCGG CCTTGATCCG CACACGCAGC CCGGCTTCGC GACGATCCGC CTGCTGGAAT TGATGTATGA GCCGCTTTTG CGTCTGGATG CCAACCTGGA GCTTCAGCCC GCCATCGCGC AAAGCTGGTC CTTCTCCGAC GATGGTCTGC AACTGACATT CCAACTGGAC CCCGCGGCGA TGTTCCACGA CGGCACATCC GTGACCTCTG CCGATGTCCG CGCCTCGTTC GAGCGTATTC TGGACGAGGA AACCGGCGCG ATTTCGCGGG CGAACTACAC GTCCATCATC AATATCGAAA CACCCGATGA TGCCACCGTG GTGTTTGAAC TGGATCGCCC CGATGCGCCG ATCCTCAACG GTCTGGCCAC GGTGAACGCC GCCGTCCTGC CTGCGTCGGC GATTGAGGCA GGCACGATCG CCACGGAAGT CGTGGGCTCC GGCCCGTTCA TGTTGGACGC CCGCACGCCC AACGCCAGCG CCACCCTGAC GAGTTTCGCG GATTGGCATG GCGGCGACGT GGCCTACGAC ACCCTGTCCA TCAGCGTGCT GCCCGATGAA ACCGCGCTTC TGGGTGCCTT GCGTGCCGGT CAGGCCGATT TTGCATTGAT CAACGATCCG TTGGTCGCGA CGCTGGTGCC CTCTACGGAT GGATTGACGC TGAACACCGC GCCCACGCTC AGCTACTATG TGCTGCAACT CAACGCAGCC CGGGAGCCGA TGGACTCGCT GCCCCTGCGG CAGGCGATCA GCTGCGCGAT CAACCGCCAG GACATTCTGG ATGCCGCCCT TCTGGGTGAG GGGGAGGTGA CGGGTCCCCT GACATCCCCC GCCTATCGCA CCGACCCAAG CAGCCTGTTC TGCTATGAGC AGGATCAGGA TCGCGCCCGC GCGTTGCTGG CAGAGGCGGG CTTTGCCGAT GGCTTCACGG CCACCGTCAT GGCCGCAACC GGTGAGCCGC CCACCGCATC GGCCGTGGCG CAGGTGATCC AGTCGCAACT GTCTGAGGTC GGCATCACGC TTGAGATCGA GATGCAGGAG CTGAGCGTCT ATATCGACCG CTGGCTTGCC GCGGATTTCG ACATGGCCGT GGCGCTGAAC GGCGGGCGCG TGGACCCCTA TACGATGTAC AACCGCTACT GGACCCGCGA CGGGAACCTG CAAGGCGTCG CCAACTACAT CGATGATACA CTCGACACGC TGATGAACGA TGGCCGGGCC GAGACGGGCG AAGAGGCCCG CCGGGAGATC TATGCCAACT TCGAGTCCCA TCTGGCCGAA ATGTCGCCCT GGGTCTGGCT GTTCACTGGC AACACCTACA CGGCTCAGAC CGACGCGGTC TCCGGATTCG TTCCCACGCC CAACGGATCG CTCTTCGGCC TCGTGGATGT GACCCTGGCT GAATAA
|
Protein sequence | MTSIKTTARA LALISTALAA PLSAQTLDLA WSQDATGLDP HTQPGFATIR LLELMYEPLL RLDANLELQP AIAQSWSFSD DGLQLTFQLD PAAMFHDGTS VTSADVRASF ERILDEETGA ISRANYTSII NIETPDDATV VFELDRPDAP ILNGLATVNA AVLPASAIEA GTIATEVVGS GPFMLDARTP NASATLTSFA DWHGGDVAYD TLSISVLPDE TALLGALRAG QADFALINDP LVATLVPSTD GLTLNTAPTL SYYVLQLNAA REPMDSLPLR QAISCAINRQ DILDAALLGE GEVTGPLTSP AYRTDPSSLF CYEQDQDRAR ALLAEAGFAD GFTATVMAAT GEPPTASAVA QVIQSQLSEV GITLEIEMQE LSVYIDRWLA ADFDMAVALN GGRVDPYTMY NRYWTRDGNL QGVANYIDDT LDTLMNDGRA ETGEEARREI YANFESHLAE MSPWVWLFTG NTYTAQTDAV SGFVPTPNGS LFGLVDVTLA E
|
| |