Gene Clim_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0900 
Symbol 
ID6354137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp986039 
End bp987724 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content51% 
IMG OID642668527 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001942958 
Protein GI189346429 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAA CATTTTTTTC AGCGCTCGTC CTGCTGCTGC TCATAACCGG CCTTACCGCA 
TGCCGCAAGG GAAACGAAAA CCTGAGAAGC GATCATATCG TTACCGGTAT ATCAGCGGAT
TTCGATTATC TCAACCCGCT TCTGATCCAG CTTTCACTCT CCCGTGAGGT CTGTACCCTT
ATCTACCCTT CACTTGTCAA GCCTGCATAC AACGAAGAAA AAGGAAAAAT AGAATATAAA
CCCTCGGCAG CAAAAAGCTG GGAGTTTTCA CCTGACGGAA AACATGCGAC CTTTCATCTT
CGTTCCGATG CTTCCTGGGA AGATGGAAAA AAAGTAACAG CACATGACTT CAAGTTTTCG
TACGCACTCT ATAAAAACCC GAAAATCGCC AGTTCACGCC AGCACTATCT CGATGATCTG
CTGCTCCTTC CCGACGGATC GGCAGATATA GAACGCGCCG TTGAAACCCC CGATGATTCA
ACCCTTGTTC TCCATTTCAA TAAATCGCTT GCCGATGAGA TCGTTATCGA CCATTTCAAT
GATCTCATGC CGGTTGCCCG TCATATCTTC TCTTCGATTC CTGCCGGTGA AATCCGCAGC
AGAGCCGCTG AGCTGCCGGT AATAGGTGCA GGGCCGTTCA AAGTGAAGGA GTGGAAGCGT
CAGGAAAAAC TGGTGCTTGC CTCAAATCCG TCGTCGGTAC TCCCCCACCC TGCCGTAACC
GGAACCATGA CCTTTCTGGT CGTGCCGGAA TACACCACCA GGCTTGCCAT GCTGAGATCG
GGACAGATCG ATGCGCTGAT CTCTGCCGGA GGCATTACCC CGAAGGATGC CGCAGAACTG
AAAACGACAA ATCCTGAAAT TACGATCAAA CCGGTACGCA ACAGATATTT TGACAGCATT
GTCTGGCTTA CTATCGACGG CGATACCTAC CGGAAGTCGC GCACTGTTGC ACCGAACCTG
TTTTTCGGCG ATCCAAGGGT GCGCAGAGCC CTGACCTATG CCGTCGATCG TGAATCCATT
ATTGACGGAT TCATGGGTCC GAACCATGCC GTTATCGTGA ACACATCACT CTCTCCGGCG
TATAAAGCGT TTGTAAACAA AACGATGGAA CCGTACTCCT TCAATCCGGA AAAATCACGG
GAACTGCTCA AAACCGCAGG CTGGACACCA GGACCTGACG GCATTCTGCA GAAGAACGGC
GTTCGGTTCT CTTTTGAACT GGCTGCCCCT GTAGGAAATC CTCGCCGGAA CTATGCCGCA
ACCATCGTCC AGCAGAACCT GAGGGATATC GGCATTGACT GTCGGCTGCG ATTCGACGAG
AGCCTGATTT TTCTGAAAAA CCAGAACGAG TTCCGCTATG ATGCGGCGCT CTCGGGCCTC
GCCGCAGAAA CCCTCCCGTT TCAGCTGATC ATCTGGGGCA GCGATTTCGC AAACCGCCCG
TTCAACTCGT CGGCATTCAG GAATCCCGAA CTCGATCGGG TTATCGAGGC GCTCGGCAAA
CCGGTAACCG GAGAAAAGAA AACCGCACTC TGGAAAACCT ATCAGCAGAT TCTCAATGAC
GAACAACCAA GAACGTTTCT CTATTATTAT GATGAACTTG AAGGATTCGG CAAGCGATTG
AGTAATGTCG AGGTCAACCT GCTTTCGACA CTCTACAACG CCTGCGACTG GAAACTGAAT
CAATAA
 
Protein sequence
MRKTFFSALV LLLLITGLTA CRKGNENLRS DHIVTGISAD FDYLNPLLIQ LSLSREVCTL 
IYPSLVKPAY NEEKGKIEYK PSAAKSWEFS PDGKHATFHL RSDASWEDGK KVTAHDFKFS
YALYKNPKIA SSRQHYLDDL LLLPDGSADI ERAVETPDDS TLVLHFNKSL ADEIVIDHFN
DLMPVARHIF SSIPAGEIRS RAAELPVIGA GPFKVKEWKR QEKLVLASNP SSVLPHPAVT
GTMTFLVVPE YTTRLAMLRS GQIDALISAG GITPKDAAEL KTTNPEITIK PVRNRYFDSI
VWLTIDGDTY RKSRTVAPNL FFGDPRVRRA LTYAVDRESI IDGFMGPNHA VIVNTSLSPA
YKAFVNKTME PYSFNPEKSR ELLKTAGWTP GPDGILQKNG VRFSFELAAP VGNPRRNYAA
TIVQQNLRDI GIDCRLRFDE SLIFLKNQNE FRYDAALSGL AAETLPFQLI IWGSDFANRP
FNSSAFRNPE LDRVIEALGK PVTGEKKTAL WKTYQQILND EQPRTFLYYY DELEGFGKRL
SNVEVNLLST LYNACDWKLN Q