Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0585 |
Symbol | |
ID | 6374249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 613282 |
End bp | 615168 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642683098 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001959025 |
Protein GI | 189499555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.952367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0792841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGTA AAGGAGAGCC AGACAAACAG CAGGCCACGC AGCCTTGCTT TACCGGCTGT TTGTCTGAAT CCTCAACTGC TCCGGCGTGT CCTGAAAAGT CCGGACTGTC TGGTGTTTTT TTACATGATC ATATCGGCAT CAACAGCGTG CTGATTGTTT GTATGCTGAT AGTCGTGACG CTTTCAGGCT GTTCATCTGA CCGCTTTTCA TCCGGCGGAT CAGCAGGTGA TTCAACGCTG GTGATCGCCA TGCTTGGTGA CGCCAACTAT CTCAATCCTG TGATCGGCGC TTCGGTAACA TCCAGCAATG TCTACGGTCT TCTCTATCCG GGTCTTCTTG AAAGCGAGTT CGACACGACG TCAGGCCTCT TGAATTTTGT CGCTCTTGAA AAAAAGTTGA GGGAATCGAC AGGGGAATCC ATCAGGAAAA AGCCGGGCGG AGCGTTGGCA AAAACATGGA AAATGGGAGA CGATTACCGT TCCATCACCT ATATACTCAG AAACGACGCG AAATGGAATG ACGGCACACC AATTACAGCG CATGATTTTA AATTCACCTA TGAGCTCTAT GGCAATCCTC TTATCGCTAG CCCCCGGCAA CAGTATCTTG CTGAGCTTGT GGGAGCTGAT ACAGGTGAAA TTGATTTCGA GAAGGCAATT GAAGCACCTG ATGACACAAC ACTCGTGTTC AACTTCTATA AAGCGGTTCC TGAACAGCTT GCGCTGTTTC ATACCTCTCT GACGCCACTT CCGAAACACA AATGGGAACA TGTGGCACTC GAAGAGTTCC GGCATTCTCC CCTCAATCAG AAACCGCTGG GCGCAGGTCC TTATGTTCTT CAGGAGTGGC TGAAACAGCA GCAGATTGTT CTTGCTTCAA ATCCTTCGTG TACGCTGCCT AAACCTGGTG ATATCGCGCG TATCATGTTC AGGATAGTGC CTGACTACAC GGTACGTCTG GCGCAGCTAC AGACTGGAGC TGTCGATGTT GTGGAAAACA TAAAACCGGA AGACTTTGCC GGCCTTGAGC GCGCCAGAGC CGGTGTGGAG ATCAAGTCAG TCGGACTGCG TGTCTATGAC TATATCGGAT GGTCGAATAT CGACCAGGTG TCTTATGAGC GTGACGGTAC TATCAGGCCG CATCCTCTGT TCGGTTCAAA GAATGTTCGC CGGGCACTGA CGCTTGCTAT TGACCGGCAA TCAATTCTCG ACGGTTATCT CGGAGAGTAC GGTGAGGTAG CCAGCACCGA TATATCTCCT TCGCTCAAAT GGGCATACAA TGATTCTGTT ACACCCTACC CCTATGATCC GTCAGAGGCG GTCAGGATTC TTGAAGAAGA GGGATGGTTT CCAGGCCCGG ATGGTATTCG AGAGAAAAAT GGCAGAAAGT TCAGCTTTGT GCTGTATACC AATGCAGGCA ACGCCCGCAG GAATTTCGCC AGTGTTATTA TTCAACAGAA TCTCAGGGAG ATTGGAATTG ACTGTCAACT GGATGTTCAG GAATCAAATG TGTTTTTTGA AAACCTGCGG CTTCGTAAAA TTGAAGCATG GATGGCAGGA TGGTCGATAG GGCTGGAAAT AGATCCTCTT GACGGATGGG GTTCAGATCT TGAAAAAAGC CGTTTTAATT TTACCGGTTA TCAGAATTCG AGAATCGATA CCCTTTGCGA ACTGGCAAAA GGTCAAATGA ATCCACTGGA TGCAAGACCG TACTGGATTG AGTATCAGGA AATTCTGCAC CGCGATCAGC CGACGACATT TTTATACTGG ATTAAGGAAA CGCAGGGATT TAACCGCAGG ATCGAAGGTG AGGAGCTGAA TATCCTCAGT ACCTTCTACA ACATTGACGA CTGGATCCTT TCCCCGTCAG CTGGTGTTGC GGAGTAG
|
Protein sequence | MNSKGEPDKQ QATQPCFTGC LSESSTAPAC PEKSGLSGVF LHDHIGINSV LIVCMLIVVT LSGCSSDRFS SGGSAGDSTL VIAMLGDANY LNPVIGASVT SSNVYGLLYP GLLESEFDTT SGLLNFVALE KKLRESTGES IRKKPGGALA KTWKMGDDYR SITYILRNDA KWNDGTPITA HDFKFTYELY GNPLIASPRQ QYLAELVGAD TGEIDFEKAI EAPDDTTLVF NFYKAVPEQL ALFHTSLTPL PKHKWEHVAL EEFRHSPLNQ KPLGAGPYVL QEWLKQQQIV LASNPSCTLP KPGDIARIMF RIVPDYTVRL AQLQTGAVDV VENIKPEDFA GLERARAGVE IKSVGLRVYD YIGWSNIDQV SYERDGTIRP HPLFGSKNVR RALTLAIDRQ SILDGYLGEY GEVASTDISP SLKWAYNDSV TPYPYDPSEA VRILEEEGWF PGPDGIREKN GRKFSFVLYT NAGNARRNFA SVIIQQNLRE IGIDCQLDVQ ESNVFFENLR LRKIEAWMAG WSIGLEIDPL DGWGSDLEKS RFNFTGYQNS RIDTLCELAK GQMNPLDARP YWIEYQEILH RDQPTTFLYW IKETQGFNRR IEGEELNILS TFYNIDDWIL SPSAGVAE
|
| |