Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0940 |
Symbol | |
ID | 6374607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1016722 |
End bp | 1018401 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642683442 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001959366 |
Protein GI | 189499896 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAA TAGTATTTTC GATAGCAATT CTTTTTCTTT TTTCAGCACT TTCCTCTTGC AGCAACAGTT CACGGGAATA TCGTTCAGAC CAGGTTGCCA TAGGCGTTGA CGCGGATTTC GATCACCTCA ACCCTCTTCT CATCCAGCTC TCCCTCTCCA GAGAAGTCTG CAATCTGATT TTCCCTTCAC TGGTCAAACC AGATTATGAC CCTGAGCTTG GCACCATTAC CTTTCAACCG AATACCGCTG AGAAATGGGA GTTCACCGAA GGCGGCAGAA AAGCCGTCTT CCATCTCCGG AAAGACGCTG TATGGCAAGA CGGTGTGCCG GTCACGTCAC ATGATTTCAA ATTTTCCTAC CGGCTCTATG CCGACCCGAA TATCGCCAGT TCGCGCCAGC ATTACCTCAA CGATCTGCTC CTCCTCGATG ACGGCAGTAT TGACTTTGAC AACGGCATTG AAACTCCTGA TGACACAACG CTTGTTCTCA CTTTCATGAA ACCGATGGCC CCGGCAATCA TTCTCGATCA TTTTAATGAC CTGATGCCTG TCGCAAAACA TATTTTCAAG TCGATCCCTC CTGAAGAAAT CCGGATGAAA GCTGCAGAAA CACCGATCAT CGGAGCAGGC CCCTTCAAGG TAAAAGAGTG GGCGCGTCAG CATAAGCTGT TGCTTGAATC AAATAAAACC TCTGTGCTCC CCCAGCCTGC TGCTGTCTCG AAAATGAGCT TTATCATCAT TCCGGAATAC ACAACCCGAC TGACGATGCT TAAGTCAGGT CAGCTCGACG CAGTTATTTC AGCAGGCGGC ATCAACCCCA AAGATCTTGA AGAGCTGAAA AGAAGCAATC CGGAGATCTC GATCAAACCC GTACGAAACC GCTACTTTGA CAGTATTGTC TGGCTCAATA TCAACGGCGA GCAGTACAGG GAAAACAAAA TAATAGAACC GAATGTCTTT TTCGGAGACA AAAGGGTACG AAAGGCCATG ACCTATGCCA TAGACCGCCA GTCGATCATC GACGGGTTTA TGGGCCCTGA ACATGCCACC ATTGTCAACA CGTCCCTCTC TCCTGCATAT GAAGCTATCG CAAACACATC GCTTGGAACT TACGCATTCG ACCCGCAAAA GGCCGAATCG CTGCTCAGGC AATCAGGCTG GGAGCCGGGA CCGGACGGCA TCCTGCAGAA AAACGGCACA CGCTTTTCAT TCACGCTTGC TGCCCCTGCA GGTAACCCCC GAAGGAATTA TGCGGCAACA ATTATCCAGC AGAACCTCCG CGAGATCGGT ATAGAATGTA AACTGAGAAT AGATGAAAAA CTCATTTTTC TGAAAAACCA GAACGAGTTC CGGTACGATG CAGCCCTGTC GGGATTAGCC GCAGAAACAC TTCCGTTTCA GCTTATCATC TGGGGGTCGG ACTTCGAAAA CCGCACGTTC AACTCTTCGG CTTTTCAGAA TCAGGCCCTG GACCGCGTCA TCAGCCGCCT TAACACCCCC CTGCCTGAAA ACGAAAGCCT CATCTTGTGG AAAGAGTACC AGAAAATCCT GCATGAAGAA CAGCCGAGAA CCTTCCTCTA CTACTATGAC GAACTTGAAG GGTTCAGCAA CCGGGTAAAA AATGTAGAAG TAAACCTTCT TTCCACCCTT TATAACGCGT ATGCGTGGGA ACTGGAATAG
|
Protein sequence | MQKIVFSIAI LFLFSALSSC SNSSREYRSD QVAIGVDADF DHLNPLLIQL SLSREVCNLI FPSLVKPDYD PELGTITFQP NTAEKWEFTE GGRKAVFHLR KDAVWQDGVP VTSHDFKFSY RLYADPNIAS SRQHYLNDLL LLDDGSIDFD NGIETPDDTT LVLTFMKPMA PAIILDHFND LMPVAKHIFK SIPPEEIRMK AAETPIIGAG PFKVKEWARQ HKLLLESNKT SVLPQPAAVS KMSFIIIPEY TTRLTMLKSG QLDAVISAGG INPKDLEELK RSNPEISIKP VRNRYFDSIV WLNINGEQYR ENKIIEPNVF FGDKRVRKAM TYAIDRQSII DGFMGPEHAT IVNTSLSPAY EAIANTSLGT YAFDPQKAES LLRQSGWEPG PDGILQKNGT RFSFTLAAPA GNPRRNYAAT IIQQNLREIG IECKLRIDEK LIFLKNQNEF RYDAALSGLA AETLPFQLII WGSDFENRTF NSSAFQNQAL DRVISRLNTP LPENESLILW KEYQKILHEE QPRTFLYYYD ELEGFSNRVK NVEVNLLSTL YNAYAWELE
|
| |