Gene Clim_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1017 
Symbol 
ID6355466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1111615 
End bp1112652 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content61% 
IMG OID642668640 
ProductRadical SAM domain protein 
Protein accessionYP_001943071 
Protein GI189346542 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTAT CCATGGTATC CTCCCTGATG CTCGTGGTGA CGACGGCATG CAATCTCTCC 
TGCCGCTACT GCTATGAAGG AGGTCGCCGT TCCGGGGAGT TCATGAGCCT CGATACGGCC
CTCTGTGCGC TCGACGTGGC GGCCCGGAGG GGGAGGCCCT TCCATGTACA GTTCACAGGG
GGGGAGCCGC TGCTTGCAGC AGATCTTGTC TTCGCCGTTC TCGAACATAT CGCCGCCGAG
GCTCTGCCGG CGACGACGGC CATCCAGACA AACGGCATAT TGCTCAACCG CGACGCCGTG
CGAAAGTTCA GGGCGCACAG GACCGCAGTA GGCATAAGCG TGGACGGTCT GCCGGGAATA
CAGGAGCGGA TGCGGGGCCA GAGTGCGGCA ACCTACAGGG CCATGCGGAT ACTCGATGAC
GAAGGGGTCC CCTTCAGCGT CACCACGGTG CTTTCCGCCG TGAATACCGG AGAGCTTGCA
AAGCTTGCCA TGGCCCTGCA CTCCTGGCCA ACGGCTTCGG CTATCGGACT CGACCTGCTG
GTGCGCAAAG GCTCTGCATC TCCGGGAAGC GGGATCGAAC CGCCTGAAGA GGCGCTGTTG
CGCCAGGGCA TAGGGGGGCT GCTCGGCACC CTCGACCTGC TGAACCGTGA ACGAAGGCAT
CCTCTTGTCC TTCGGGAAAA ACAGCTGGTA CAGAGGGCCT TGAAAAACGC CGTTACGGCA
GCACCCTACT GTTCCGCCTG CACCGGAGCG AGCCTCGCCA TTACGCCCGG AGGGGAGCTC
TACCCCTGCA CCCAGACCAT GGGGGATCCT GATTTTTTTC TCGGAACGCT CGCCCGCCCC
GACATGTCGC CTTCCCGAAC CTTTGCCGGA GAGTCTCCGG TCAGGGAAGG GTGCTCCGGC
TGCGTGCTCG ATGGGCGCTG TCCGGGTGAC TGCCCGTCCC GGATGCATTA CAACAGAGGG
AACCAGTGCG ACCTCGTCTG TACGCTCTAT CGAACCATCT ACGATTACTG CAAGCAAACA
GGAGAAATTC CATCATGA
 
Protein sequence
MMLSMVSSLM LVVTTACNLS CRYCYEGGRR SGEFMSLDTA LCALDVAARR GRPFHVQFTG 
GEPLLAADLV FAVLEHIAAE ALPATTAIQT NGILLNRDAV RKFRAHRTAV GISVDGLPGI
QERMRGQSAA TYRAMRILDD EGVPFSVTTV LSAVNTGELA KLAMALHSWP TASAIGLDLL
VRKGSASPGS GIEPPEEALL RQGIGGLLGT LDLLNRERRH PLVLREKQLV QRALKNAVTA
APYCSACTGA SLAITPGGEL YPCTQTMGDP DFFLGTLARP DMSPSRTFAG ESPVREGCSG
CVLDGRCPGD CPSRMHYNRG NQCDLVCTLY RTIYDYCKQT GEIPS