Gene Clim_2455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2455 
Symbol 
ID6354725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2688980 
End bp2690365 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content47% 
IMG OID642670044 
Producthypothetical protein 
Protein accessionYP_001944454 
Protein GI189347925 
COG category 
COG ID 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000334533 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTTA TTCTCGCTTG CTTTCTCGTC GTAAGTCGTG TGGTTTTAGC GAAAAGTACG 
GCAGAGGAAG TCGGCCTGGG GCCGCAGTCA ACACAGGCTG TCGGGAATAA GCGGGTTCTG
ATGGTTGTTG TACGGTTTCC TGATGCAGCG CCGACAACCC CAATCGAGGT TGTAAAGAAA
AAGGTTATTG AAGGACTCGG TTCGTATGTT GACGAACAGT CGTATGGACT TGCTTCTATA
ACAGCCGATT TCAGGGGGTA TGTTATGTTG CCCGATGCAC TCGCAGACTA TAGGGTAAGT
CCCTATAATT TCCGTGTGGA CAAAACAAGA ATTCGCAAAC TTATTGGCGA CACCATGACC
GCCATAGAGA AGGATACTGA TTTTTCGGCC TACGATCACT TCATGATAGT ACCCGCGGTA
CACACCATGC CAGGACAAGG GTATGGGATG ATCTGTTACT GCGCAAATCC TGGTATGCTT
TCAGGTGTTA CAAAGGGATA TGTTCCTCGG TATGTAACCA TGAAATCGGC AGGAGGAAAA
GAGTTTAGCG GCGGGATTTT TGTAGGGGCA GAGAATGCGA ATATCGGCAT GTTCGCACAT
GATTATTTTC ATGTTCTGGC AGGGGTTCAT GACGGGAGGC GACTCGTGCC CTGTCTCTAT
AATTATAAGC TGCAGTCCGA TGCTTCAGCA GGTCTCCCCT CATTTGAACA TCATGCTACC
TATATGGGAC TTTGGGACAT TATGTCGCAG CATTTTGTAA AAAAGGGAGA GCCTCCTCAA
GGAACATCGT CGTTTACTAA AATAAGGCTT GGCTGGATCA AGAAGCATCA GGTTCGGATT
GTAAAACCTG GCGCAACCGA TTTCACCCTG CTTGCGCCTC TCTCAAAAGG AGGTCAATTA
CTTGCGGTCA AGATACCGTT AGACGACGGG TCGTATTATC TTGTGGAGAA CAGGCAACCA
ATAGGATTTG ACAGGATGCT TCCTGATTCG GGAATAATTG TGCTGAAAGT AAATCCTGTG
GCTGATGAGG GATATGGTAC AGTAGAAGTT CTCTGTGCTG CGGGGGCAGG CAATTTTATG
GAGGCGACCT ACAGGCTGGA GGCAAGCAAA AGGGATTGTT TTGTCGACGA AAGAAATAAT
GTTACGATAC TGCCCTTATG GAAGCAGCAC GAACATGTCG GGGTGCTGAT CACAACGTCA
GAACATCGTG AAGCTGCGGG TAAAGCTGCT CGGGCTATAC AGGCTCTTAT CGATCAAACC
GCCGTGACAA AGGACAATAC AATGGAAACG GTAATTCTTG AAGCTGTTAC TGCGTTCAGG
AATAATGAAT TTGAAAAAAG CTATACTATT GCCATCAAAA AGAACGGAAA GGATATCCGC
CATTGA
 
Protein sequence
MPLILACFLV VSRVVLAKST AEEVGLGPQS TQAVGNKRVL MVVVRFPDAA PTTPIEVVKK 
KVIEGLGSYV DEQSYGLASI TADFRGYVML PDALADYRVS PYNFRVDKTR IRKLIGDTMT
AIEKDTDFSA YDHFMIVPAV HTMPGQGYGM ICYCANPGML SGVTKGYVPR YVTMKSAGGK
EFSGGIFVGA ENANIGMFAH DYFHVLAGVH DGRRLVPCLY NYKLQSDASA GLPSFEHHAT
YMGLWDIMSQ HFVKKGEPPQ GTSSFTKIRL GWIKKHQVRI VKPGATDFTL LAPLSKGGQL
LAVKIPLDDG SYYLVENRQP IGFDRMLPDS GIIVLKVNPV ADEGYGTVEV LCAAGAGNFM
EATYRLEASK RDCFVDERNN VTILPLWKQH EHVGVLITTS EHREAAGKAA RAIQALIDQT
AVTKDNTMET VILEAVTAFR NNEFEKSYTI AIKKNGKDIR H