Gene Clim_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1977 
Symbol 
ID6355481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2194516 
End bp2195574 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID642669575 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_001943988 
Protein GI189347459 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.574903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACATA CGTTCAGAAC CGTTTTTGCC GTGCTGCTGC TGGCGCTGGA ACTTGCCGGG 
TGCACCGCCT CCTTTCAGGC AGTGCAGCAG CGCCCGGTTC TTTCGCTGTT TTACGCGACC
GACCGGGCCA TGTCCGGAAG CAGCGAACCG GGGGAGTTCT ACAATTCGGA TCACGCTCCG
CTGCAGTATG GAACCTGCAC AGTCTCGGTG CCGCAAAAGC ACCGCATAGC CGAGCTTGAA
AGGCCGGTGC TGAGCATGCA TCCGGAACGT CATTTTGAAC TTCTTTCGAT CGATACCCTC
GACAAGCAGG TTTTTTTCGA TAAAGTGGGG CTCTTCATGC AGCGCGCCGG CAGCCGGAAA
ACTGCTCTGG TATTTGTTCA CGGTTTCAAC ATAAGTTTCG AGGCCGCCAC ACTGCGCATG
GCCCAGATGA CCTCCGATCT CGATTTCAGA GGCACACCGC TGGTCTACAG CTGGCCGTCG
GACGCTTCGC TCGGTTCATA TCGCGAGGAC GAACGGAGCG TTGTCGAAAC CGAAGGCAAT
CTTTACCGTT TTCTTTGCGG TATAGCCGAG CGTTCCGGAA AGGCAGGCAT CTATCTGCTT
GCCCACAGCA TGGGAACCCG TGCCCTGACC TCGGCTTTCA TCATGCTTGC AAAAGAGCGC
CCCGAACTGC TTTCCCGTTT CGGTGCCATC GTGCTTGCCG CGCCGGATAT CAATGCGGAA
CGCTTCAGAC GTGAACTTGC GCCATCCCTC GCAGGCAACG GGGTGCCGGT AACGGTTTAC
GCTTCGCGTT CGGACAATGC GCTCAGGGTC TCCGAAAATG TCAACGGCAA CCCGAGGGCC
GGTGAAGTCG CAGATATACC GCTTATCGTG CCCGGCATTG AAACCATCGA TGCCACCGAT
GTCGACAGCG ATCTTCTTGG CCATTCCTAT TACAACCGCT CCAGAACGGT GCTTTCGGAC
ATGTTTTATA TCATCAGCAG AGGACTTCCC GCCTCGGAGC GTTTTTCTCT CCAGCCGGTC
GATACCGCGG CGGGGAGGTA CTGGAGGTTC CGTAAATAG
 
Protein sequence
MIHTFRTVFA VLLLALELAG CTASFQAVQQ RPVLSLFYAT DRAMSGSSEP GEFYNSDHAP 
LQYGTCTVSV PQKHRIAELE RPVLSMHPER HFELLSIDTL DKQVFFDKVG LFMQRAGSRK
TALVFVHGFN ISFEAATLRM AQMTSDLDFR GTPLVYSWPS DASLGSYRED ERSVVETEGN
LYRFLCGIAE RSGKAGIYLL AHSMGTRALT SAFIMLAKER PELLSRFGAI VLAAPDINAE
RFRRELAPSL AGNGVPVTVY ASRSDNALRV SENVNGNPRA GEVADIPLIV PGIETIDATD
VDSDLLGHSY YNRSRTVLSD MFYIISRGLP ASERFSLQPV DTAAGRYWRF RK