Gene Cphamn1_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0585 
Symbol 
ID6374249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp613282 
End bp615168 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content49% 
IMG OID642683098 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001959025 
Protein GI189499555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.952367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0792841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGTA AAGGAGAGCC AGACAAACAG CAGGCCACGC AGCCTTGCTT TACCGGCTGT 
TTGTCTGAAT CCTCAACTGC TCCGGCGTGT CCTGAAAAGT CCGGACTGTC TGGTGTTTTT
TTACATGATC ATATCGGCAT CAACAGCGTG CTGATTGTTT GTATGCTGAT AGTCGTGACG
CTTTCAGGCT GTTCATCTGA CCGCTTTTCA TCCGGCGGAT CAGCAGGTGA TTCAACGCTG
GTGATCGCCA TGCTTGGTGA CGCCAACTAT CTCAATCCTG TGATCGGCGC TTCGGTAACA
TCCAGCAATG TCTACGGTCT TCTCTATCCG GGTCTTCTTG AAAGCGAGTT CGACACGACG
TCAGGCCTCT TGAATTTTGT CGCTCTTGAA AAAAAGTTGA GGGAATCGAC AGGGGAATCC
ATCAGGAAAA AGCCGGGCGG AGCGTTGGCA AAAACATGGA AAATGGGAGA CGATTACCGT
TCCATCACCT ATATACTCAG AAACGACGCG AAATGGAATG ACGGCACACC AATTACAGCG
CATGATTTTA AATTCACCTA TGAGCTCTAT GGCAATCCTC TTATCGCTAG CCCCCGGCAA
CAGTATCTTG CTGAGCTTGT GGGAGCTGAT ACAGGTGAAA TTGATTTCGA GAAGGCAATT
GAAGCACCTG ATGACACAAC ACTCGTGTTC AACTTCTATA AAGCGGTTCC TGAACAGCTT
GCGCTGTTTC ATACCTCTCT GACGCCACTT CCGAAACACA AATGGGAACA TGTGGCACTC
GAAGAGTTCC GGCATTCTCC CCTCAATCAG AAACCGCTGG GCGCAGGTCC TTATGTTCTT
CAGGAGTGGC TGAAACAGCA GCAGATTGTT CTTGCTTCAA ATCCTTCGTG TACGCTGCCT
AAACCTGGTG ATATCGCGCG TATCATGTTC AGGATAGTGC CTGACTACAC GGTACGTCTG
GCGCAGCTAC AGACTGGAGC TGTCGATGTT GTGGAAAACA TAAAACCGGA AGACTTTGCC
GGCCTTGAGC GCGCCAGAGC CGGTGTGGAG ATCAAGTCAG TCGGACTGCG TGTCTATGAC
TATATCGGAT GGTCGAATAT CGACCAGGTG TCTTATGAGC GTGACGGTAC TATCAGGCCG
CATCCTCTGT TCGGTTCAAA GAATGTTCGC CGGGCACTGA CGCTTGCTAT TGACCGGCAA
TCAATTCTCG ACGGTTATCT CGGAGAGTAC GGTGAGGTAG CCAGCACCGA TATATCTCCT
TCGCTCAAAT GGGCATACAA TGATTCTGTT ACACCCTACC CCTATGATCC GTCAGAGGCG
GTCAGGATTC TTGAAGAAGA GGGATGGTTT CCAGGCCCGG ATGGTATTCG AGAGAAAAAT
GGCAGAAAGT TCAGCTTTGT GCTGTATACC AATGCAGGCA ACGCCCGCAG GAATTTCGCC
AGTGTTATTA TTCAACAGAA TCTCAGGGAG ATTGGAATTG ACTGTCAACT GGATGTTCAG
GAATCAAATG TGTTTTTTGA AAACCTGCGG CTTCGTAAAA TTGAAGCATG GATGGCAGGA
TGGTCGATAG GGCTGGAAAT AGATCCTCTT GACGGATGGG GTTCAGATCT TGAAAAAAGC
CGTTTTAATT TTACCGGTTA TCAGAATTCG AGAATCGATA CCCTTTGCGA ACTGGCAAAA
GGTCAAATGA ATCCACTGGA TGCAAGACCG TACTGGATTG AGTATCAGGA AATTCTGCAC
CGCGATCAGC CGACGACATT TTTATACTGG ATTAAGGAAA CGCAGGGATT TAACCGCAGG
ATCGAAGGTG AGGAGCTGAA TATCCTCAGT ACCTTCTACA ACATTGACGA CTGGATCCTT
TCCCCGTCAG CTGGTGTTGC GGAGTAG
 
Protein sequence
MNSKGEPDKQ QATQPCFTGC LSESSTAPAC PEKSGLSGVF LHDHIGINSV LIVCMLIVVT 
LSGCSSDRFS SGGSAGDSTL VIAMLGDANY LNPVIGASVT SSNVYGLLYP GLLESEFDTT
SGLLNFVALE KKLRESTGES IRKKPGGALA KTWKMGDDYR SITYILRNDA KWNDGTPITA
HDFKFTYELY GNPLIASPRQ QYLAELVGAD TGEIDFEKAI EAPDDTTLVF NFYKAVPEQL
ALFHTSLTPL PKHKWEHVAL EEFRHSPLNQ KPLGAGPYVL QEWLKQQQIV LASNPSCTLP
KPGDIARIMF RIVPDYTVRL AQLQTGAVDV VENIKPEDFA GLERARAGVE IKSVGLRVYD
YIGWSNIDQV SYERDGTIRP HPLFGSKNVR RALTLAIDRQ SILDGYLGEY GEVASTDISP
SLKWAYNDSV TPYPYDPSEA VRILEEEGWF PGPDGIREKN GRKFSFVLYT NAGNARRNFA
SVIIQQNLRE IGIDCQLDVQ ESNVFFENLR LRKIEAWMAG WSIGLEIDPL DGWGSDLEKS
RFNFTGYQNS RIDTLCELAK GQMNPLDARP YWIEYQEILH RDQPTTFLYW IKETQGFNRR
IEGEELNILS TFYNIDDWIL SPSAGVAE