Gene Clim_0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0054 
Symbol 
ID6355577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp60329 
End bp61579 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content52% 
IMG OID642667678 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_001942140 
Protein GI189345611 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.549481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGC TGCCTGACAG TTCGGTTCAC CTTGTCATTA CCTCTCCTCC CTACTGGCAG 
CTCAAGGACT ACGGGACGGA GAACCAGATC GGATTTCACG ACAGCTATGA GAGCTACATC
AACAATCTGA ATCTTGTCTG GAGCGAGTGC GAACGGGTGC TGCATCCCGG CTGCAGGCTC
TGCATCAACA TCGGCGACCA GTTCGCCCGT TCGGTGTATT ACGGCCGGTA CAAGGTTATC
CCGATCCGGA CGGAGATCAT CAGGTTCTGC GAGACTATCG GTTTCGACTA CATGGGCGCG
GTGATCTGGC AGAAGGTGAC CACAACCAAC ACCACTGGCG GGGCATCCAT TATGGGAAGC
TTCCCGTATC CGCGCAACGG CATTCTCAAG CTCGATTATG AGTTCATTCT CCTGTTCAAA
AAGCCGGGAG ATGCGCCCAA GCCGGCAAAA GAGCAGAAAG AGCGCTCCGC CATGAGCACC
GAAGAGTGGA ACACCTGTTT CTCCGGACAC TGGAACTTTG CCGGAGCAAA GCAGGATGGC
CACATCGCCG TGTTTCCGGA AGAGCTTCCG CATCGCCTGA TCAGGATGTT CGCATTCAGC
GGAGAAACGG TGCTCGATCC GTTCATGGGC AGCGGGACTA CCAGTCTTGC GGCAAAAAAC
CTCGACAGGA ACTCGGTCGG CTACGAAATC AATCCCGAGT TTATCGGAAT AGCAAAAGAG
AAACTCCGTG CCAACCAGAC GGACTTTGCC GGAACGGAGT ATATTTTTCA GCACGATGTC
CTGAAGGGGG ATATTTCCGA AATGATCGAG CGTCTTCCTT ATCGTTTTCA AGACCCCCAC
AAACTCGACA AGAAAATCGA CCCACGAAAG CTGACGTTCG GATCAAGAGT AGAAAAGGGT
AGCGGGGCAA AACAGGAAGA GACGTTTATC GTCAGGGAGA TTCTAAGCCC CGAGATGGTC
AGATTGTCCA ACGGCCTGAC GGTGAGGCTG ATCGGAGTGA AGGAAGAACC TTTTACGCGG
GAAAAAGCTG TCGGGTATCT CGTTGACAAG ATCAAGGGAA AACGGATTTT CATGAAGTAC
GACAGCATGA AATACGATGG GGGGGACAAT TTGCTCTGTT ACCTCTATCT GGAAAACAAG
ACGTTTGTCA ACGCGCATCT GATCAAGAGC GGTTTAGTCG GGATTGACGG CAGCTACGAT
TATAAATACC GGAGCAAATT TCAAACTTTT TCCGAACAGG TCAATGGCTA A
 
Protein sequence
MNLLPDSSVH LVITSPPYWQ LKDYGTENQI GFHDSYESYI NNLNLVWSEC ERVLHPGCRL 
CINIGDQFAR SVYYGRYKVI PIRTEIIRFC ETIGFDYMGA VIWQKVTTTN TTGGASIMGS
FPYPRNGILK LDYEFILLFK KPGDAPKPAK EQKERSAMST EEWNTCFSGH WNFAGAKQDG
HIAVFPEELP HRLIRMFAFS GETVLDPFMG SGTTSLAAKN LDRNSVGYEI NPEFIGIAKE
KLRANQTDFA GTEYIFQHDV LKGDISEMIE RLPYRFQDPH KLDKKIDPRK LTFGSRVEKG
SGAKQEETFI VREILSPEMV RLSNGLTVRL IGVKEEPFTR EKAVGYLVDK IKGKRIFMKY
DSMKYDGGDN LLCYLYLENK TFVNAHLIKS GLVGIDGSYD YKYRSKFQTF SEQVNG