Gene Clim_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0314 
Symbol 
ID6353831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp346905 
End bp348083 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID642667943 
ProductDNA protecting protein DprA 
Protein accessionYP_001942387 
Protein GI189345858 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGC CACATGCAGG AAAAGATGCC GGGCTTTTCC TGTTGACGCT CGCCAGCGTC 
CCCGGACTCG GGCCTGCAAG AATAAATGCC ATCATAACCC GCTTCGGCTA TCAGCCCGAT
CTGCTCAGGG CATCTGCGGA TGTATTTCTT GAGGTTCCCG GCATAGGCCG GTCACTTGCG
GAAGAGATCT CCGGTTTTCT CAGCGGCAGT AAACGACGCG AAGCGGAAGA AGCTTCCCTT
CGACAGTATG AGGAGCTTGA TCGCCATCAG GCCTCGCTGG TCACGATCTT CGACCCATGT
TTTCCTGCCC TGCTCAAAGA AATTTATGAC CCGCCCCCTT TTCTGTTCGT TCGCGGCTCT
TTTTCCGAAC CGGAACCGCC ATCGATAGCC ATAGTAGGAA CCAGGCGAGC ATCAGCTTAT
GGAAAACAGG CAGCGGGCCT GCTCTCAGGC GAACTTGCTT CCCGGGGCCT GCTGATCGTC
AGCGGTCTGG CATATGGAAT CGATACCGCG GCGCATGAGG CCGCCATGAG GGCAGGAGGA
AAAACCATCG CGGTGCTTGC AGGCAGTGTC GACCATGTCT ATACCGATCC CAGGGGGAAA
ATCTGGCCGA AAATCATCGA ACAGGGTGCT CTCATTTCAG AAGAACTGTT CGGTTCCGAA
CTGCTCCCCG GAAAATTCCC CAAACGGAAC AGGATCATCT CGGGAATGTC GCTCGGCACT
GTTGTCGTTG AATCCGACCT GAAAGGTGGA GCGCTCATCA CGGCATCGTA CGCACTTGAA
CAGAACCGGG AGGTCTTCGC CGTACCGGGA ACCATATACT CGCACAATTC AAGGGGAACA
AACCGCCTGA TCCAGTCCGG GCAGGCAAAA ATGGTTCTTG CGACAGACGA TGTGCTTGAA
GAACTTAACC GTCCTTCCCT GAATATCCCG GTGCATGAGC ATGCTGCAAC CGATACCGTA
ACCATCGTCC TGTCGAAAGC AGAACGTGAG CTGCTGGCGT ACATGGATAC CGGACCGATA
CATATCGACG CCCTTGCCCT GCAGGCCGGG CATGATATTT CTGAATTGCT CGTTCTGTTA
TTCGAGCTTG AACTGAAGAA AGCCGTCGTC CAGCTCCCCG GCCAATTCTT TGGTAAAAAA
CAGATAAAAC ATGAAAAGAA TAGTCATTAT AGGCTCTAA
 
Protein sequence
MNEPHAGKDA GLFLLTLASV PGLGPARINA IITRFGYQPD LLRASADVFL EVPGIGRSLA 
EEISGFLSGS KRREAEEASL RQYEELDRHQ ASLVTIFDPC FPALLKEIYD PPPFLFVRGS
FSEPEPPSIA IVGTRRASAY GKQAAGLLSG ELASRGLLIV SGLAYGIDTA AHEAAMRAGG
KTIAVLAGSV DHVYTDPRGK IWPKIIEQGA LISEELFGSE LLPGKFPKRN RIISGMSLGT
VVVESDLKGG ALITASYALE QNREVFAVPG TIYSHNSRGT NRLIQSGQAK MVLATDDVLE
ELNRPSLNIP VHEHAATDTV TIVLSKAERE LLAYMDTGPI HIDALALQAG HDISELLVLL
FELELKKAVV QLPGQFFGKK QIKHEKNSHY RL