Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3050 |
Symbol | |
ID | 3935521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 3081031 |
End bp | 3082650 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637905421 |
Product | extracellular solute-binding protein |
Protein accession | YP_510992 |
Protein GI | 89055541 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.423075 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGTC ACGCCCTACT TGCTGCCACA AGCGCGCTTG TACTCGCCCT GCCCGCCTAT GCGCAGGAAA CGCACCCTGA AACTGGTGAA GCGCTGGCTG CCAATCAGGA TTTTTCCTAT CGCCTGCTGG ATCAGTTTCC ATCGCTCGAC CCGCAGCTGA TTGAGGAAAC CGCGGGCGGA CACGTCGGGC GGCAGTTGTT CGAAGGGCTC TTGACACAAA ACGCGGACGG GTCCCTGCGC CCGGGTGTGG CAACAGAATG GTCAAGCGAC GACAACCAGA CATGGACTTT TACGCTGCGC GACGATGCGC GCTGGTCAAA CGGCGATCCG GTCACAGCCA ATGACTTCGT TTATGCTTGG CGTCGCGCCG CTGATCCGGT GACCGCATCC GAGTACTCTT GGTACGTCGA GCTGACACAG ATGACCAACG CGGCCGAAAT CATCGCCGGT GAAATGCCGA CGGAGGAATT GGGCGTGCGC GCCATCGACG ATCACACGCT GGAAGTCACG CTGAATGCGC CCCTGCCCTA CTTCCCGCAG ATGGCCGTGC ACTATACCTT GATGCCGACG CACCAGCCCA CGATTGAAGC CCATGGATCT GACTGGACCC AACCTGAGAA TATCGTCAGC AATGGCGCTT ACATCCTGAC CGAGATCGCA ATCAACGAGT ATTTCCGGCT GGAACGTAAC CCCGAATATT GGGGTGCCGA TGACGTGATC ATTGACAGCG TTACGGGCTA TGTCATCAAC GACGCCAATC AGGCGCTCAG CCGCTTTCAG GCCGGTGAGT TTGACATGAT GGACGACCTT CCGGCGGGAT CATATCCCGA TCTGGAAGCC GAGATGCCAG ATACGGTCCA TGCGACGCCG CGCCTGTGCA CCTATTACTA CCTGATCAAC CAGTCCGAAA GCGGGGCCGA GGCATTGCAG GACGTTCGCG TGCGCACGGC TTTGAGCTAT GCCATCCGGC GTGAGGTGAT CACCGACCAG ATCCTTCAAG CGGGTCAGCG CCCCGCCTAC AGCTTCACCC ATTGGGCAAC AGCCGATTTC GAAATGCCCG ACATTGCCTA CGCCAACATG ACCCAGGACG CACGCATGGA AGAAGCCATG CGCCTGATGA CCGAGGCTGG TTACGGGCCG GACAACCCAC TGGAACTGGA CCTTATCTAC AACACGTCAG AGAACCACCG GCAGATCGCA ATCGCCGCCT CACAGATGTG GGCACCGCTG GGGGTTGAGA TCTCGCTGTC GAACTACGAA TGGCAGAGCT ACCTTGATGT CCGCGGCAAC CAAAACTTCG ATCTCGGCCG CGCGGCATGG TGTGGCGACT ACAACGAGGC GTCGACATTT CTTGATCTGC TGACATCGAA CAATGACAAT AACGATGGCA AGTTCGTGAA CGCCGACTAT GACGCCCTGA TGGCAGAAGC GGCGGTGACG GCCGATCCGA GCGCTCTCTA CGAGCAAGCC GAACAGATCC TTGCTGACCA AATGGCACTG ATCCCGATCT ACCACTATTC CCAGAACTTC GTGCTGGACC CAACCATCCA CAACTGGCCG ATGGAGAACG TGGAAAACAA TTGGTACGTG CGCGATCTCT ACCGCGTCGC GTCCGAGTGA
|
Protein sequence | MTRHALLAAT SALVLALPAY AQETHPETGE ALAANQDFSY RLLDQFPSLD PQLIEETAGG HVGRQLFEGL LTQNADGSLR PGVATEWSSD DNQTWTFTLR DDARWSNGDP VTANDFVYAW RRAADPVTAS EYSWYVELTQ MTNAAEIIAG EMPTEELGVR AIDDHTLEVT LNAPLPYFPQ MAVHYTLMPT HQPTIEAHGS DWTQPENIVS NGAYILTEIA INEYFRLERN PEYWGADDVI IDSVTGYVIN DANQALSRFQ AGEFDMMDDL PAGSYPDLEA EMPDTVHATP RLCTYYYLIN QSESGAEALQ DVRVRTALSY AIRREVITDQ ILQAGQRPAY SFTHWATADF EMPDIAYANM TQDARMEEAM RLMTEAGYGP DNPLELDLIY NTSENHRQIA IAASQMWAPL GVEISLSNYE WQSYLDVRGN QNFDLGRAAW CGDYNEASTF LDLLTSNNDN NDGKFVNADY DALMAEAAVT ADPSALYEQA EQILADQMAL IPIYHYSQNF VLDPTIHNWP MENVENNWYV RDLYRVASE
|
| |