Gene Clim_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2120 
Symbol 
ID6355098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2340375 
End bp2341778 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content53% 
IMG OID642669711 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001944123 
Protein GI189347594 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCTG AACAATTCCT GTTCCAGGCC GAAGGCGGAT TCATCAGGCT GCCGGTATGG 
GGACATATCG CCCTGAGCAA CCCTCTGAAG CATATTCTTG CCCATCCTTC GTTTCTGCGC
CTGAAAGGCA TACGGCAGTT ATCGTTCTCC CAGCAGGTAT ATCCGGGAGC TACGCACACC
CGTTTTGAAC ATTCGATCGG CGTGTACCAT CTGATGAAAC TGATTCTGCA GCGTATGGTG
AGCAACCCGC TTGCCGTCGG ACTGCAGAAT GGGAGGTTCC GGTTTGACGA CGGGAGCTGC
CGTCTGCTTC TGGCTGCCAG CCTGCTGCAT GATATAGGCC ATTACCCGCA TGCCCATGTG
ATCGAGGAAC AGATTCCTGC CGGCAGTTGC GGCCCGGTGT TCTCGCATCA CGAAGATCTT
TGCGGAAGAT TTATCTTTCA GGAACAACCG GGATTTCCCC CGATTGCCGA AATCCTTCAC
AATGAGTGGA AGGTTGATGC CAAAGAGGTG ATCGCGCTGA TCGAAGGCAC CTCGACAAGC
GGTTTCGGCA AATTGATCAG CGGCACCCTC GACCCTGACA AGATGGATTA CCTGATGCGC
GATGCGCACC ACTGCAACAT ACCCTACGGA AGCATCGACA TCGAACGGCT CATCGAGTCG
TTTGTGCCGG ATCCCGAACG TGAGCGGTTT GCCATTACTG AAAAAGGAAT CGCGCCGCTT
GAGAGCCTGC TGTTTGCCAA ATACATGATG ATGCGCAATG TGTACTGGCA TCATACCGGC
AGGGCGCTCT CAGCCATGCT CAGACGGCTG CTGCAGGCAG TAATCGATGG AGAGCTGCTG
AGCGGACAAC AGCTGGAATC GCTCTTTTAC GACAATGCCG ACGACCGGGT ACTCTTCGAA
CTGAGAACGA TGCTGCCACA GAGGGTTTCG GGGGAAACCC TGCTGCTCGA CGATATTCTG
CAGCGCAGGG TATATAAACG GGTAGTCACC ATTCAGCCAT ATACGAAAAA CGGAATGGAC
GAACGCTGGT TCGCCTATGC CTCGGACAAT TCGTTCTGCC GGCAAAAAGA GCGGGAAATA
TGCGGTTTTC TCTCCAAACG CCATAACATG AACCTCAGCG GTCTGGAGGT ATTGATCGAC
CCTCCGTCAA AAAAGGACAT TTTCGATTAC AACGATCTCA GAGAGCTGCG GGTTTACCCT
ACCCGATCGG AACACCTGCA CTATTCGCTG CAGCTCTCTT CGGAGTACTG CCGTTTCGAT
GATTTCGACG AATCGGTCTT CCGTTCGGAT TTTATTCTAT CGTTCGAGCG TTACACTAAA
AAATTCAGAC TGCTCTGCAG GGAAAATATT ATGGAAAAAG TATCCGAGTC GATGAACGGG
GTAATGGAGA TTCTGCAGTC GTGA
 
Protein sequence
MIAEQFLFQA EGGFIRLPVW GHIALSNPLK HILAHPSFLR LKGIRQLSFS QQVYPGATHT 
RFEHSIGVYH LMKLILQRMV SNPLAVGLQN GRFRFDDGSC RLLLAASLLH DIGHYPHAHV
IEEQIPAGSC GPVFSHHEDL CGRFIFQEQP GFPPIAEILH NEWKVDAKEV IALIEGTSTS
GFGKLISGTL DPDKMDYLMR DAHHCNIPYG SIDIERLIES FVPDPERERF AITEKGIAPL
ESLLFAKYMM MRNVYWHHTG RALSAMLRRL LQAVIDGELL SGQQLESLFY DNADDRVLFE
LRTMLPQRVS GETLLLDDIL QRRVYKRVVT IQPYTKNGMD ERWFAYASDN SFCRQKEREI
CGFLSKRHNM NLSGLEVLID PPSKKDIFDY NDLRELRVYP TRSEHLHYSL QLSSEYCRFD
DFDESVFRSD FILSFERYTK KFRLLCRENI MEKVSESMNG VMEILQS