Gene Clim_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1498 
Symbol 
ID6354814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1610209 
End bp1612401 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content52% 
IMG OID642669104 
ProductProlyl oligopeptidase 
Protein accessionYP_001943529 
Protein GI189347000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTTC GAAAAAGCAC TCTTGCGCTC ACCGTTCTTC TTGTTTTTCC CGGTACAACT 
CCAGCCGCAA ACCGTGTAAA CGGCTTGCCC GATGCGCCAC CGCCTGCGGC AGTAAAACCT
TTTCAGGAAA AAGTCTGTGA TACAACCATA GTCGATACCT ATCGCTATAT GGAAAAGCTC
AGCGATCCGG AGGTCACCAG GTGGATGCAG GAGCAATCGG CATACACCCG GAAAGTATTG
AACAGGATAC CCGGCAGAGA GAAGCTGTTG CATAAAATGC AGGAGTTCGA CGACAGGAAA
TCAGCAAAGA TCTATAACCT TACCATTACC GAAACCGATC GCTACTTCTA CCTCAAGCAG
ACTCCTGCGG ATGAGACCGG AAAGCTCTAT TTCCGGGATG GATTTGCCGG ATCGGAAAAC
CTGCTGTTCG ACCCGTCTGC ATACACGGGA GATAAAAAAG GCAGCTATGT GATCGGCACT
ATCGCTCCGA ATAATGACGG ATCAAAAGTC GCCTTTACCG TTTATCCGAA CGGTTCTGAA
AATGCCGCGC TGCTGATCAT GGAAACCGAA AAAGCGCAAC GGTATCCCGA AACCATAAGC
AGATGCCGTT TTGCTTCGCC CTCATGGCTT CCCGATGGAA CCTCTTTTCT CTACAACCGC
CTTGAGCCAT CCGGCAAACA GGGAAAAAAT TCTCAGTATG CGAGTAAAAC ATGGCTGCAC
CGGACCGGAA GCGACCCGTC GACGGATCGT GAGATATTTT CAAGCGCCCT GAATACCGAA
CTGGATATCA ATCCTGAAGA TATTCCCGAC GTTTCCTACG ACAAGGAGAG CGCTACCCTG
TTTGCCTTCG TATCGAATGT CGACCGGCGG CTCAAGGTTT ATTACTCACC GGCGTCAGAG
CTTGAAAAAG AACAGATTAC CTGGAAAAAG CTTTTTGAGC CCGAAGACGA AATCCATGAC
TTTGCCGTCA GGAATAACGA GCTGTACCTC TACACTCCGA AAAACGCACC GGGCTTCAGG
GTGCTGAAAA CCTCATTGCA GAACCCCGAT CTGAAAAGGG CGGAAGTGGT TATTCCAGAG
TTCAGAAATG CGAAACTCAG CGGCATGACG CTTACCAGCA GGGGTATTTT TTATAAACTG
TCGCTGAACG GCGTACAGGA AGAGCTCTAT CATCTGGAAT ATGGAAGCCT TCTGGCCAGA
AAGCTCACGC TCCCTTTCCA CGCAGGCACC ATTACGCTCT CGTCGAAAGG GTTCAGATAC
CCTGAAGTGT GGGCCGTGCT GGCCGGATGG AACCGCGATT ATCGACGCTT CCGTTACGAT
GCGGAAAGCC GCTCGTTCAT CAATGAAACC CTCTCATCAC CAGCCCAATA TCCTGAATAT
GGGGATCTTG CCGTGGAGGA ACTGATGGTC CGCTCACAGG ATGGCGTTGC TGTGCCTCTA
TCGCTCATCT ATAAAAAGGA TCTTCTGAAA AACGGATCGA ATCCCGTGCT GCTCTACAGT
TACGGAGCAT ACGGACGGTC GATGACACCG TTTTTCAGCC CATCGATGCT GCTCTGGACG
TGGAAAGGCG GAATTCTTGC GGTTCCGCAT GTGCGGGGAG GAGGCGAACT TGGCGACAAG
TGGCATACAT CAGGGATGAA AACAACAAAA GCCAACACCT GGAAAGATGC CATCAGTGCG
GCGGAATTCC TGGTTAAAAA CGGATACACC TCACCCGGCA ACATCGCCAT CAATGGTGCC
AGTGCCGGCG GAATACTGGT CGGTCGGGCC ATAACCGAAC GGCCCGATCT CTTTGCCGCC
GCCATTCCTC AGGTAGGGGC AATGAATCCG CTGCGCGCAG AAACAACCGC CAACGGCCCG
GTCAACGTGC CGGAATTCGG AACGGTTAAA ATTCCTGACG AATGCAGGGC GCTCATTGCC
ATGGATCCCT ATCTCAATCT TCGTGACGGC GTAAACTACC CTGCCGCGCT CATAACCGCC
GGAATCAACG ACCCGAGGGT GATTGCCTGG CAACCTGCAA AATTCGCCGC GCGGATGCAG
GCCGCAACCG CATCTTCGAA ACCGGTGCTC CTGTTCACCG ATTTCGAAGC CGGACACGGT
ATGGGAAACT CGAAGACAAA AAACTTCGAA GCTCTTGCAG ACGTCCTGAG TTTCGGTCTG
TGGCAGACAG GACATCCGGA ATTTCGGAAG TAA
 
Protein sequence
MILRKSTLAL TVLLVFPGTT PAANRVNGLP DAPPPAAVKP FQEKVCDTTI VDTYRYMEKL 
SDPEVTRWMQ EQSAYTRKVL NRIPGREKLL HKMQEFDDRK SAKIYNLTIT ETDRYFYLKQ
TPADETGKLY FRDGFAGSEN LLFDPSAYTG DKKGSYVIGT IAPNNDGSKV AFTVYPNGSE
NAALLIMETE KAQRYPETIS RCRFASPSWL PDGTSFLYNR LEPSGKQGKN SQYASKTWLH
RTGSDPSTDR EIFSSALNTE LDINPEDIPD VSYDKESATL FAFVSNVDRR LKVYYSPASE
LEKEQITWKK LFEPEDEIHD FAVRNNELYL YTPKNAPGFR VLKTSLQNPD LKRAEVVIPE
FRNAKLSGMT LTSRGIFYKL SLNGVQEELY HLEYGSLLAR KLTLPFHAGT ITLSSKGFRY
PEVWAVLAGW NRDYRRFRYD AESRSFINET LSSPAQYPEY GDLAVEELMV RSQDGVAVPL
SLIYKKDLLK NGSNPVLLYS YGAYGRSMTP FFSPSMLLWT WKGGILAVPH VRGGGELGDK
WHTSGMKTTK ANTWKDAISA AEFLVKNGYT SPGNIAINGA SAGGILVGRA ITERPDLFAA
AIPQVGAMNP LRAETTANGP VNVPEFGTVK IPDECRALIA MDPYLNLRDG VNYPAALITA
GINDPRVIAW QPAKFAARMQ AATASSKPVL LFTDFEAGHG MGNSKTKNFE ALADVLSFGL
WQTGHPEFRK