Gene Clim_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0623 
Symbol 
ID6354071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp702236 
End bp703942 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content54% 
IMG OID642668254 
ProductDNA repair protein RecN 
Protein accessionYP_001942689 
Protein GI189346160 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.72171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAGCA GCCTTTACAT CAGAAACTTT GCCCTGATAC GAGAACTTAC CGTAGAGTTT 
TCCAGAGGCC TCTGCATCAT TACCGGCGAA ACCGGTGCCG GCAAATCGAT GCTTATCGGA
GCACTCAGCC TTGTGCTTGG AGAACGCTCC AGCAGCGACC TTGTCCGTTC AGGCGAAAAC
AAGGCCATTA TCGAAGCCAT GCTCTGCGGT CAGCTCCCTG AGCGGCTCGG TGCCCTGCTC
GAAGAGGCGG GAATTGAATG CACGAACGAC ACTCTTCTGC GCAGGGAAAT TTCCGTTTCG
GGGCAGTCAC GCTGTTTTAT CAATGACACA CCCTGCACGG CGGGAGTGCT GAAACAGGTC
GGAGAACTGC TCATAGACCT GCACGGTCAG CACGACCATC AGCTCCTGCT CAATGCGGCG
TCCCATGAGG GCATGCTCGA TGCATTTTCC GGATGTGCAT CGGAAAGCTC CGCTTACCGT
GATACGGTTT CCCGCCTCTC TTCACTCTAC CGGCGAAAGA GCGTGCTTGC CCTTCAGGCA
GCGGAAGCAA AAGAAAAAAA AGAGATGATG CAGTTCCAGT TCAACGAACT GAATGCCCTT
GACCTGAAAA ACGGTGAAGA GGAGGAACTG GAGAGTGAAA TAATCCTGCT CGAAAATGCA
GAAACGCTCT ACGGGCTTGG TTCGGAACTC GGGAATCTCC TCTACGAACA GGATCATTCG
GCATATGCAG CGCTCTCATC AGCCCGGCAT ATTCTGGAAA AACTTTCCGC CATAGACAAA
CGGTTCGAAA GCCGCCTTGA AGACGTCCTC TCGGCGGAAA ACATGGTTGA CGATCTCTAT
CGTTTTGTAA ACCGTTACAC TGCGGCCGTC GAATTCAACA GCGACCGGCT CGATACCATG
AGAACCCGTC AGCATCTGCT GCAGCGCACC CGAAAAAAAT ACGCCAAAAC CCTGTCCGAA
CTGATTTCCT GGAGAGATGA ACTGACCGCC GCCCTTGGCA TTGAAGAGTC GATTGCCGAA
GAAAATTCTC TTATCGACAC GGAGATCGGT TCGCTCCGGG AAAAACTCTC CGCTGCGGCG
GCATCCCTGT CTCAAAAACG GAAAAACGCG GCACGCCGAC TCGATGAAAC GCTGCAGCGG
GAGCTCTCGA TGCTCGGCAT TGCCAGCGCA CGGTTCAAAA CGGCTTTTAC GCCCGAAGAG
GATCCGGAAG GCGACATAAC GCTCGATGGA ATCCGCTACA AGGCTCTTGC GAACGGACAT
GAAAAGATCG AGTTCCTGTT TTCAGCCAAC ACCGGAGAAG AACTGAAACC ATTGGCAAGG
TCTGCCTCCG GAGGAGAAAT TTCCCGGGTA ATGCTCGCCC TGAAGAGCGC GCTTGCAGAA
TCTGCGGCAC TCCCTATTCT TGTATTTGAT GAAATCGATA CCGGCATCAG CGGCACAACG
GCCCTTGCCG TAGCATCCAG CCTCAAAAGG CTTTCGCGTC TGCATCAGAT CATCGCGATC
ACCCATCTCC CGCAGATTGC CGCCATGGCC GATCTGCACC TTTCGATCAG TAAAACGATC
GAAAACGGGA GAACCTCTGC CGGTGTGCTG CATCTTGATG AACCGGGACA CATCCGGGCC
GTTGCCGAAC TCATCAGCGG AAGAAACGTA TCCGAATCCT CCCTCAGACT TGCCGGTGAA
CTGATAGAGA GCGCAAAATC AATTTAG
 
Protein sequence
MLSSLYIRNF ALIRELTVEF SRGLCIITGE TGAGKSMLIG ALSLVLGERS SSDLVRSGEN 
KAIIEAMLCG QLPERLGALL EEAGIECTND TLLRREISVS GQSRCFINDT PCTAGVLKQV
GELLIDLHGQ HDHQLLLNAA SHEGMLDAFS GCASESSAYR DTVSRLSSLY RRKSVLALQA
AEAKEKKEMM QFQFNELNAL DLKNGEEEEL ESEIILLENA ETLYGLGSEL GNLLYEQDHS
AYAALSSARH ILEKLSAIDK RFESRLEDVL SAENMVDDLY RFVNRYTAAV EFNSDRLDTM
RTRQHLLQRT RKKYAKTLSE LISWRDELTA ALGIEESIAE ENSLIDTEIG SLREKLSAAA
ASLSQKRKNA ARRLDETLQR ELSMLGIASA RFKTAFTPEE DPEGDITLDG IRYKALANGH
EKIEFLFSAN TGEELKPLAR SASGGEISRV MLALKSALAE SAALPILVFD EIDTGISGTT
ALAVASSLKR LSRLHQIIAI THLPQIAAMA DLHLSISKTI ENGRTSAGVL HLDEPGHIRA
VAELISGRNV SESSLRLAGE LIESAKSI