Gene Cphamn1_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0940 
Symbol 
ID6374607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1016722 
End bp1018401 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content49% 
IMG OID642683442 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001959366 
Protein GI189499896 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA TAGTATTTTC GATAGCAATT CTTTTTCTTT TTTCAGCACT TTCCTCTTGC 
AGCAACAGTT CACGGGAATA TCGTTCAGAC CAGGTTGCCA TAGGCGTTGA CGCGGATTTC
GATCACCTCA ACCCTCTTCT CATCCAGCTC TCCCTCTCCA GAGAAGTCTG CAATCTGATT
TTCCCTTCAC TGGTCAAACC AGATTATGAC CCTGAGCTTG GCACCATTAC CTTTCAACCG
AATACCGCTG AGAAATGGGA GTTCACCGAA GGCGGCAGAA AAGCCGTCTT CCATCTCCGG
AAAGACGCTG TATGGCAAGA CGGTGTGCCG GTCACGTCAC ATGATTTCAA ATTTTCCTAC
CGGCTCTATG CCGACCCGAA TATCGCCAGT TCGCGCCAGC ATTACCTCAA CGATCTGCTC
CTCCTCGATG ACGGCAGTAT TGACTTTGAC AACGGCATTG AAACTCCTGA TGACACAACG
CTTGTTCTCA CTTTCATGAA ACCGATGGCC CCGGCAATCA TTCTCGATCA TTTTAATGAC
CTGATGCCTG TCGCAAAACA TATTTTCAAG TCGATCCCTC CTGAAGAAAT CCGGATGAAA
GCTGCAGAAA CACCGATCAT CGGAGCAGGC CCCTTCAAGG TAAAAGAGTG GGCGCGTCAG
CATAAGCTGT TGCTTGAATC AAATAAAACC TCTGTGCTCC CCCAGCCTGC TGCTGTCTCG
AAAATGAGCT TTATCATCAT TCCGGAATAC ACAACCCGAC TGACGATGCT TAAGTCAGGT
CAGCTCGACG CAGTTATTTC AGCAGGCGGC ATCAACCCCA AAGATCTTGA AGAGCTGAAA
AGAAGCAATC CGGAGATCTC GATCAAACCC GTACGAAACC GCTACTTTGA CAGTATTGTC
TGGCTCAATA TCAACGGCGA GCAGTACAGG GAAAACAAAA TAATAGAACC GAATGTCTTT
TTCGGAGACA AAAGGGTACG AAAGGCCATG ACCTATGCCA TAGACCGCCA GTCGATCATC
GACGGGTTTA TGGGCCCTGA ACATGCCACC ATTGTCAACA CGTCCCTCTC TCCTGCATAT
GAAGCTATCG CAAACACATC GCTTGGAACT TACGCATTCG ACCCGCAAAA GGCCGAATCG
CTGCTCAGGC AATCAGGCTG GGAGCCGGGA CCGGACGGCA TCCTGCAGAA AAACGGCACA
CGCTTTTCAT TCACGCTTGC TGCCCCTGCA GGTAACCCCC GAAGGAATTA TGCGGCAACA
ATTATCCAGC AGAACCTCCG CGAGATCGGT ATAGAATGTA AACTGAGAAT AGATGAAAAA
CTCATTTTTC TGAAAAACCA GAACGAGTTC CGGTACGATG CAGCCCTGTC GGGATTAGCC
GCAGAAACAC TTCCGTTTCA GCTTATCATC TGGGGGTCGG ACTTCGAAAA CCGCACGTTC
AACTCTTCGG CTTTTCAGAA TCAGGCCCTG GACCGCGTCA TCAGCCGCCT TAACACCCCC
CTGCCTGAAA ACGAAAGCCT CATCTTGTGG AAAGAGTACC AGAAAATCCT GCATGAAGAA
CAGCCGAGAA CCTTCCTCTA CTACTATGAC GAACTTGAAG GGTTCAGCAA CCGGGTAAAA
AATGTAGAAG TAAACCTTCT TTCCACCCTT TATAACGCGT ATGCGTGGGA ACTGGAATAG
 
Protein sequence
MQKIVFSIAI LFLFSALSSC SNSSREYRSD QVAIGVDADF DHLNPLLIQL SLSREVCNLI 
FPSLVKPDYD PELGTITFQP NTAEKWEFTE GGRKAVFHLR KDAVWQDGVP VTSHDFKFSY
RLYADPNIAS SRQHYLNDLL LLDDGSIDFD NGIETPDDTT LVLTFMKPMA PAIILDHFND
LMPVAKHIFK SIPPEEIRMK AAETPIIGAG PFKVKEWARQ HKLLLESNKT SVLPQPAAVS
KMSFIIIPEY TTRLTMLKSG QLDAVISAGG INPKDLEELK RSNPEISIKP VRNRYFDSIV
WLNINGEQYR ENKIIEPNVF FGDKRVRKAM TYAIDRQSII DGFMGPEHAT IVNTSLSPAY
EAIANTSLGT YAFDPQKAES LLRQSGWEPG PDGILQKNGT RFSFTLAAPA GNPRRNYAAT
IIQQNLREIG IECKLRIDEK LIFLKNQNEF RYDAALSGLA AETLPFQLII WGSDFENRTF
NSSAFQNQAL DRVISRLNTP LPENESLILW KEYQKILHEE QPRTFLYYYD ELEGFSNRVK
NVEVNLLSTL YNAYAWELE