Gene Clim_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2136 
Symbol 
ID6355930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2354324 
End bp2355562 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content51% 
IMG OID642669727 
Productpeptidase M16 domain protein 
Protein accessionYP_001944139 
Protein GI189347610 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGTA TTCAAAGCAG CACCCTGAAA AACGGCCTCC GGGTGATAAC CGATCATGTC 
CCCTGGGTAC AGAGCGTCAC CCTCGGCATT CATATCGACG TCGGCTCCCG GGACGATCCC
GATAAAAAAA GCGGCCTTGC GCATTTTCTT GAACACGCGG TCTTCAAGGG AACAAAAAAA
CGGGACTATA TCGAGATCGC CTGCGGAATC GAACGAAACG GCGGTTATCT CGATGCCTAT
ACGACAAAGG AACAGACCTG TATCTACCTG CGGTGCCTCG ACCGGTTTAC CGAACCCGCT
CTCGATCTTC TGGCCGACCT GGTCTGCAAC CCGGTATTTC CCGAAGAGGA AATCGAAAAA
GAAAAAGAGG TGGTTCTCGA AGAGATCAGC AGCATCAACG ACACGCCGGA AGAGGTCGTT
TTCGAGGATT TTGACCGCTA CCTTTTCCGC CGTCACCCGC TCGGAACCCC TATTCTTGGA
ACGGACAAAA GCGTCAGCAA CTTTGAAAGC AGCGATCTCA CCGCTTTCAT GGCCAACTTT
TACCGGCCTG AAAACATGTT TTTAACGGCG ACCGGCAATA TCCGTCACGC GGAACTCGCA
AAACTGGCGG AACGCTGTTT TTCAACGTTA TCCCAAAACC TTACCCCTGC CCCTGAAAGA
AAACCATTTC TTCCCGGACA ATACAAAGCG TTCTCCCGGA CAGTCAAAAA AAGAGCCCAT
CAGGCCCAGA TTGTGCTTGG CTCAGCAGTT GCCCGTCACG ACAGATCTTT TTACAGCCTG
ATGGTACTGA ACACCCTCCT TGGCAGCGGA ATGAGCTCGA TACTGAACCT TGAACTGCGG
GAAAAGCTTG CCCTTGTTTA CAGTACCTAT TCATCAATTG CATTCTATGA TGATCTGACG
GTCATGAACA TCTATGCGGG AACCGACAGC AACAAAATCA CACAGACACT CGATGTCCTC
GCATCTGTCA TGAAAAGCCC TGAACTTATA GCTCCTGCCA AAGAAGAGCT GCGGTCGGCA
AAATCAAAAC TCCTCGGTTC GTTCCTGATG GGCACGGAAA AAATGACACG CCGCATGTCG
CACCTGGCCA CCGATCTCTC GTATTTCGGA AAATATATCC CGCTCGAAGA GAAAACCGCC
GCCATTGAGA ACGTTACGGT AACGGATATA ACAACAGCTG CCCGAATGCT GCTTGAAGAA
GTTCCGCTCT CGACTCTCGT GTTCAAACCC ATGCGATAA
 
Protein sequence
MEGIQSSTLK NGLRVITDHV PWVQSVTLGI HIDVGSRDDP DKKSGLAHFL EHAVFKGTKK 
RDYIEIACGI ERNGGYLDAY TTKEQTCIYL RCLDRFTEPA LDLLADLVCN PVFPEEEIEK
EKEVVLEEIS SINDTPEEVV FEDFDRYLFR RHPLGTPILG TDKSVSNFES SDLTAFMANF
YRPENMFLTA TGNIRHAELA KLAERCFSTL SQNLTPAPER KPFLPGQYKA FSRTVKKRAH
QAQIVLGSAV ARHDRSFYSL MVLNTLLGSG MSSILNLELR EKLALVYSTY SSIAFYDDLT
VMNIYAGTDS NKITQTLDVL ASVMKSPELI APAKEELRSA KSKLLGSFLM GTEKMTRRMS
HLATDLSYFG KYIPLEEKTA AIENVTVTDI TTAARMLLEE VPLSTLVFKP MR