Gene Clim_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1673 
Symbol 
ID6353980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1838877 
End bp1840394 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content49% 
IMG OID642669278 
Productprotease Do 
Protein accessionYP_001943694 
Protein GI189347165 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0684799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA GAAAAAAAAA ATTGAAATAC CTGCTGCTGC TCCTTGTGGG AATAGCTGTG 
GGCAGCGTTG TTTTTGCCAA TGTCGAGTTT AGTTTTCCCG GTCGGAATGC GGGTTTGAAT
CTCAGTAACA AACCAAATTA CGCCAATGCG AAAAATAACT TTGAGAATTA CCCGATACAG
ACACTTCGAG ATTTCAACGA AGCGTTCGTA AAAATTGCAG AATCGGCAAC ACCTTCGGTG
GTGACCATTT TTACCGAAAA GACCGTGAAT AGAAGGGTTA TCAGTCCGTT CGAGTTTTTC
GGACGCAGCC CTTTTGATGA TTTTTTTGGT TCGCCGAAGG GGGGGAGCCA GAATGGACGC
AAGGAGGTGC AGCGCGGTCT CGGTTCGGGT GTGATTGTGA CGTCGGACGG ATATATCCTT
ACCAACAATC ATGTCGTAGA TAAGGCAGAT GCCATCTATA TCCGCACATC CGACAACAGA
AAAATAGAGG CTAAAATCAT CGGTACGGAT CCTAAAACCG ATCTTGCCGT GATCAAGGTG
AACGTAAGAG GGCTTAAACC CATCATGATA GGGGACAGCG ACCGGCTTCG TGTTGGCGAG
TGGGTTATCG CTATCGGCAG TCCTCTCGGA GAGAGCCTCG CCAGAACCGT TACCCAGGGC
ATTGTCAGCG CTATCGGACG TTCCAATGTC GGCCTTGCCG ATTACGAGGA TTTTATCCAG
ACCGATGCAG CGATCAATCC GGGCAACTCG GGAGGCCCGC TGGTCAATAT CAACGGTGAA
TTGGTAGGCA TCAATACCGC AATTGCGAGC CGTACCGGGG GGTTTGAAGG TATTGGTTTT
GCGGTTCCTT CAAATATGGC ACAGAAAGTA CTGAATTCAC TCATTACGAC GGGAAAGGTT
ACCCGGGGCT ATCTTGGTGT GAGTATTCAG GACATTGACG AGAATATTGC AAAAGGTCTC
CAGCTTCAGA GCGCATCCGG GGTGCTTGTC GGTACCGTTG TGGCCGGCAG TCCAGCTGCA
AAATCAGGAA TCAGAACAGG TGATGTCATT CTTGATTTCA ACGACAGGAA ATCTACATCG
AGTGTTGATC TTCGCAATGA GATTGCCGTG ATGTCTCCGG GGTCCGTGGT TAAAATAAGG
ATTCTCAGAG ACGGAGAGGT AAAAGTTTTC AGTGTCCGTC TTGAAGAACA GCCGGATCAG
GCTTCTGTTT CTGCTGCACC TGCACAGCAG AACCCCGAAG TGCTTGGTTT CAGGTCACAG
GAGCTGACGC CTCAGCTCGC TCAGCAATTT CAGTTGCAGC AGGGTGCCGG AAAAGTGCTT
GTCACCGATG TCGACCAGTC AAGCAATGCA TTTCGTGCCG GTCTTCGTGC CGGTGACGTT
ATTGTAGCGG TCAACAAGCA GGCGATAAGT TCATATGCAC AGTACGCAGC GCTTCTGCAA
AAAGTGAAAG GCGGCGATCT TGTGTTTCTG CTTATCGACC GGCGAGGCAG TAAGGTCTAT
TTTGCCTTTA ATGTATAA
 
Protein sequence
MNKRKKKLKY LLLLLVGIAV GSVVFANVEF SFPGRNAGLN LSNKPNYANA KNNFENYPIQ 
TLRDFNEAFV KIAESATPSV VTIFTEKTVN RRVISPFEFF GRSPFDDFFG SPKGGSQNGR
KEVQRGLGSG VIVTSDGYIL TNNHVVDKAD AIYIRTSDNR KIEAKIIGTD PKTDLAVIKV
NVRGLKPIMI GDSDRLRVGE WVIAIGSPLG ESLARTVTQG IVSAIGRSNV GLADYEDFIQ
TDAAINPGNS GGPLVNINGE LVGINTAIAS RTGGFEGIGF AVPSNMAQKV LNSLITTGKV
TRGYLGVSIQ DIDENIAKGL QLQSASGVLV GTVVAGSPAA KSGIRTGDVI LDFNDRKSTS
SVDLRNEIAV MSPGSVVKIR ILRDGEVKVF SVRLEEQPDQ ASVSAAPAQQ NPEVLGFRSQ
ELTPQLAQQF QLQQGAGKVL VTDVDQSSNA FRAGLRAGDV IVAVNKQAIS SYAQYAALLQ
KVKGGDLVFL LIDRRGSKVY FAFNV