Gene Clim_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0614 
Symbol 
ID6354062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp688932 
End bp690713 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content52% 
IMG OID642668245 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001942680 
Protein GI189346151 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.459899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAG CTGACTGCAT TACGAGGCCG TACCGTGGCG GCGCTCTCTA TTGCTGTTTT 
CTGGTACTTT CGCTGCTGCT GAGCTCATGC ACCGGCGGTT CCAACAACAG GGAGCGAGGC
AGTGCGGCAA AAGATACGAC ACTGGTTATC ACCATGCTTG GCGATGCCGA TTTTCTCAAT
CCGGTTATCG GGGCCAGTGT CACCTCGAGC AATATATCGG GGCTGATCTA TCCTTCCCTG
CTGCAGAGCG AATTCGACAC GACAACCGGT CTTCTGAATT TTCTGGCCCT TGAAAAACAG
CTCCGTCCTT CAATCGGAGG GAAAAAACCG GAAGCTGCGC TGGCAAAAAC CTGGAAGATG
TCCGCTGATC ACAAGTCCAT AACGTATATC CTGCGTGACG ACGCCTTCTG GGATGATGGG
AAACCGATTG TTTCGGGGGA TTTCAGTTTT ACCTACCGTC TCTACGGCAA TCCTCTCATT
GCAAGCCCCC GCCAGCAGTA CCTTGCGGAG TTGATTGGCG CCGATAAAGG CAGTGTCGAT
TTCGACAGGG CAATCGAAAC CCCTGACGAT ACAACGCTGA TTTTCAGATT TTTTAAACCG
GTTCCGGAAC ATCTGGCACT TTTTCACACG TCGCTTACCC CGCTTCCCGC ACATCTATGG
AAAGGTATAA AGCCTGAGGA TTTAAGGAGC TCGCCGCTCA ATCAGAAGCC TGTCGGGGCC
GGGCCGTACA GGCTGCAGGC ATGGGGCAAG CAGCAGGAGC TTGTGCTTGC TTCGAACAGG
AGATCAAATC TGCCGAAACC CGGCAACATC GCGCTGATCA ACTGGCGGAT TGTGCCGGAT
TATACGGTAA GACTTGCTCA ACTGCAGACC AATGCCGTTG ATATCGTGGA AAATATCAAG
CCTGAGGATT TTCCTGCACT GGTAAAGGCC AATCCCGAAG TTGAAATAAA GAGCGTCGGG
CTCAGGGTCT ATGATTATGT CGGTTGGTCG AATATTGATC AGGCCGTCTA TCATAAAACC
GGGAAAACGG TGCCGCATCC GCTTTTCGGT TCTCCTGAAG TGAGACGGGC ACTTACCATG
GCGGTTGATC GCGAAGCGAT TATCGACGGG TATCTCAAAG AGTATGGTAC GCTTTGCAAT
ACCGATATTT CGCCATCGCT TAAATGGGCT TATAACCGCT CGATCACTCC TCACTCTTAT
GATCCTGCGG CGGCCGTATC GCTGCTCGGA AAACAGGGCT GGAAACCCGG ACCGGACGGC
ATTCTTCAGA AAAACGGCCG AAAGTTCAGC TTTGTGCTCT ACACCAATTC CGGCAACGCA
CGCCGAAACT ATGCAAGCGT GATCATTCAG CAGAACCTCA AGGCTATAGG CATCGATTGC
AGGCTCGATG TGCAGGAATC GAACGTCTTT TTCGAAAACC TGCAGAATCG CAAACTCGAC
GCATGGATGG CCGGCTGGTC TATCGGTCTG GAGATAGATC CTCTCGATGT CTGGGGTTCG
GATCTCAAGA AGAGTACCTT CAATTTCGTC GGTTACCGGA ATCCGAGAAT CGATGAAATC
TGTGAGCTGG CTAAAGGGAA GATGGTTCAG CCCGATGCGC GTCCCTACTG GCTGGAGTAT
CAGGATATCA TCCATCGCGA CCAGCCGGTT ACTTTTCTCT ACTGGATCCG TGAGACTCAG
GGATTCAGTA AAAGAATCGG CGGCGAACAG CTTAATATAT CCGGTACTTT TTACAATATT
GACGACTGGA CGCTCACTCC ATCGGCAACA CATGCACCGT AG
 
Protein sequence
MMKADCITRP YRGGALYCCF LVLSLLLSSC TGGSNNRERG SAAKDTTLVI TMLGDADFLN 
PVIGASVTSS NISGLIYPSL LQSEFDTTTG LLNFLALEKQ LRPSIGGKKP EAALAKTWKM
SADHKSITYI LRDDAFWDDG KPIVSGDFSF TYRLYGNPLI ASPRQQYLAE LIGADKGSVD
FDRAIETPDD TTLIFRFFKP VPEHLALFHT SLTPLPAHLW KGIKPEDLRS SPLNQKPVGA
GPYRLQAWGK QQELVLASNR RSNLPKPGNI ALINWRIVPD YTVRLAQLQT NAVDIVENIK
PEDFPALVKA NPEVEIKSVG LRVYDYVGWS NIDQAVYHKT GKTVPHPLFG SPEVRRALTM
AVDREAIIDG YLKEYGTLCN TDISPSLKWA YNRSITPHSY DPAAAVSLLG KQGWKPGPDG
ILQKNGRKFS FVLYTNSGNA RRNYASVIIQ QNLKAIGIDC RLDVQESNVF FENLQNRKLD
AWMAGWSIGL EIDPLDVWGS DLKKSTFNFV GYRNPRIDEI CELAKGKMVQ PDARPYWLEY
QDIIHRDQPV TFLYWIRETQ GFSKRIGGEQ LNISGTFYNI DDWTLTPSAT HAP