Gene Clim_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2069 
Symbol 
ID6355047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2280937 
End bp2282619 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content54% 
IMG OID642669665 
ProductPeptidoglycan-binding domain 1 protein 
Protein accessionYP_001944077 
Protein GI189347548 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.118953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCTT TCGTTTTTCT TCTGCTTTTG CTTGTTTCGG CTCGAGTCGT TCAGGGCGAG 
CCGGTTTCTC TTCAGAATCA GGTTATCGGA AAGAAAACGA CGGCTCGGCA GCATGCGCCC
GATTCCGTTC TCGCTGCACA GTTGCGTTGC CATTTTCAGG CGATGGACAG CAACGCGGCA
GGTCCGGAAC GGAGGGCATT CAACAATCAG CTGGCCCGCT TTTATGCCGC ACGGAACTAT
AGGCCGGTCT GGACGGAACG GGCAGACATT GCCGAACTCA TCGAAGCCAT CGGTGAAAGT
GAAAACGATG GGTTGATTCC CGATGATTAT CACATCAAAG AGATCCGCAC TTTTTTCCTC
TCTCCTCCGC GTACTCCTGA ACTGCAGGCG AAGTACGATC TGCTGCTCAG CGATGCATTG
CTGAGTCTTG CATATCATCT TCGTTTCGGG AAAGTAGATC CGGAAAGCCT TGACCCCAAC
TGGAATCTTG ACGGCACTGC GCGTCGGACG GCACTTGAAT ACCGGTTGCA GAATGCTCTT
GCCGCGGGCC GCCCCAAAGC GGCGCTCGAT GAACTTCGAC CGAAGCATTC CGGATACGCC
GAACTGAAAA AAGGTCTGGC CCGCTACCGG GTTATCGCAC GGGCAGGTGG TTGGCAGAAG
GTTCCCGAGG GGGATTCTTT CAGGGAAGGA GTCAGAGACA GCCGGGTTCC TCTCCTTCGA
AAACGGCTTC AGCAGTCCGG AGACCTTCCG GGCGGGGTTA CCGACAGCTC GAAGGTATAC
ACTGCTGCCA TGGCAAATGC CGTGAAACGG TTTCAGAAAC GCAACGGCCT GTCGGTTGAC
GGCGTAGCCG GAACGGCGAC AATCGGTGAA ATCAATATTT CAGCAGCTGA GCGTGTCGAT
CAGATACGCC TTAATCTGGA GCGTTACCGC TGGTTCGTCA ACGATCTCGA GCCAACCTAC
GTGCTGGTGA ACATTGCCGG CTTCACTCTG CAGTATATAG AGAACGGGCG CTATCGCTGG
GGAACGCGGG TGATTGTGGG ACAACCCTAT CGAGAGACCC CGGTTTTCAA GGCAGATATG
CAGTATATCG TCTTCAATCC GCAATGGGTT ATTCCGCCGA CCATTCTTGC CGAGGACGCT
CTCCCGGCCA TTCGTAACAG CCGCTCCTAT CTTGACAGAA AGAAACTCAG GGTAATCGAT
TCCAGGGGCA GGGTGGTCGA TCCGGCTTCA GTCAACTGGT CGGGCTATTC GGCAGCCAAC
TTTCCCTATC GGCTTCAGCA AACAGCCGGT GACCATGGAG CCCTTGGCAG AATCAAGTTC
ATGATGCCCA ACAAACACGT TATCTATCTT CACGATACGC CGACCAAAAA CCTGTTTGAA
AAAAGCGAGC GCACCTTCAG TTCCGGTTGT ATAAGGGTTG AAAATCCGCT CGATCTTGCG
CAGCTTGTGC TGCAGGATTC GGTAAAATGG AACAAAACCA GTATCGACAG CACTATCGGT
ACGGGAAAAA CAAGCACGGT CAATCTTCCG AAAAGGATAC CGGTTTTTCT TCTCTATCTG
ACGGCAATCG CCGAAGGTGA GGAGATACAG TTCCGCCGGG ATGTCTATAA CCGAGACGAT
CGCCTTCGGA AGGCGCTCGA TTCACCGGTA CCGCAATACC GGATCGAAAG CTGCGGACTC
TGA
 
Protein sequence
MRAFVFLLLL LVSARVVQGE PVSLQNQVIG KKTTARQHAP DSVLAAQLRC HFQAMDSNAA 
GPERRAFNNQ LARFYAARNY RPVWTERADI AELIEAIGES ENDGLIPDDY HIKEIRTFFL
SPPRTPELQA KYDLLLSDAL LSLAYHLRFG KVDPESLDPN WNLDGTARRT ALEYRLQNAL
AAGRPKAALD ELRPKHSGYA ELKKGLARYR VIARAGGWQK VPEGDSFREG VRDSRVPLLR
KRLQQSGDLP GGVTDSSKVY TAAMANAVKR FQKRNGLSVD GVAGTATIGE INISAAERVD
QIRLNLERYR WFVNDLEPTY VLVNIAGFTL QYIENGRYRW GTRVIVGQPY RETPVFKADM
QYIVFNPQWV IPPTILAEDA LPAIRNSRSY LDRKKLRVID SRGRVVDPAS VNWSGYSAAN
FPYRLQQTAG DHGALGRIKF MMPNKHVIYL HDTPTKNLFE KSERTFSSGC IRVENPLDLA
QLVLQDSVKW NKTSIDSTIG TGKTSTVNLP KRIPVFLLYL TAIAEGEEIQ FRRDVYNRDD
RLRKALDSPV PQYRIESCGL