Gene Rleg_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0001 
SymboldnaA 
ID8015384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp55 
End bp1605 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content62% 
IMG OID644822592 
Productchromosomal replication initiation protein 
Protein accessionYP_002973852 
Protein GI241202756 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.962631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00495499 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGATGA ATACGATGAC GACGAGCGGG CTCGACAATG GGGATGCGGC ACCGCAGGCG 
TTCGGCTCCA TTCGCCTGGA AGCGGCGGAA GTAAAGGCGG ATATGAAGCA GAACGTATTG
TTTGAGCGCG TCACCGCGCG CTTGAAGGCT CAGGTCGGTC AGGATGTCTA CGCCAGCTGG
TTCGCCCGGC TGAAGCTGCA TTCGGTATCG AAGAGCGTCG TTCGCCTTTC GGTCCCCACG
ACCTTCCTGA AGTCGTGGAT CAACAATCGT TATCTCGATC TCATCACCGG TCTGTTCCAG
GCCGAAGATC CGGAAATTCT GAAAATCGAA GTCCTGGTGC GTACGGCGAC GCGCCACGGC
ACGAAGGCGC TCGATGAGGC GGTCGCGCCG GAACCAGCCG CCCCTACGCA GATGCGCCGC
CCGGCAAGCG CTCAGCCGGC CGGTCAGGCC GTCCAGCAGG CGGTTTCGGC CGTTGCCGCC
GCAAGGCCCG CAAGCTTCGG CTCGCCGCTC TTCGGTTCGC CGCTCGATAG CCGCTTTACC
TTCGACACCT TCGTCGAAGG CAGCTCGAAC CGGGTAGCAC TTGCGGCTGC AAAGACGATC
GCGGAAGCCG GTCAGGGCGC CGTGCGCTTC AACCCGCTCT TCATCCATTC GACCGTCGGC
CTCGGCAAGA CCCACCTGCT GCAGGCTGTC GCCAATGCGG CAGTGCAGAA CCCCAGGGCT
CTGCGCGTCG TCTATCTGAC GGCCGAATAT TTCATGTGGC GTTTCGCCAC CGCGATCCGC
GACAATGATG CGCTGACGCT GAAGGATTCG CTGCGCAACA TCGATCTCTT GATCATCGAC
GACATGCAGT TCCTGCAGGG CAAGATGATC CAGCATGAAT TCTGCCATCT CCTCAACATG
CTTCTCGACA GCGCCAAGCA GGTCGTCGTT GCCGCCGACC GTGCGCCCTG GGAGCTGGAG
TCGCTCGACC CCCGCGTTCG CTCGCGCCTC CAGGGCGGCG TCGCGATCGA ATTCGACGCG
CCGGATTACG AGATGCGTCT CGAAATCCTC AAGCGTCGCC TTGCTGTCGC CCGGCTCGAA
GATCCGTCGC TCGAAATTCC GGCCGAGTTG CTCCAGCATG TCGCTCGCAA CGTCACGGCC
AGCGGCCGCG AACTTGAAGG CGCTTTCAAC CAGCTGGTCT TCCGCCGCTC CTTCGAGCCG
AACCTGTCGA TCGAACGCGT CGACGAACTG CTCGCCCATC TGGTCGGCTC CGGCGAACCC
CGCCGTGTGC GCATCGAGGA TATCCAGCGC ATCGTTGCAA GACACTACAA TGTCTCGCGC
CAGGAACTGG TGTCGAACCG CCGCACCCGC GTCATCGTCA AGCCGCGCCA GATCGCCATG
TATCTGTCGA AGACGCTGAC GCCACGCTCC TTCCCGGAGA TCGGCCGCCG TTTCGGCGGG
CGTGATCACA CGACCGTGCT GCACGCCGTG CGCAAGATCG AGGAACTAAT TTCGGGAGAC
ACCAAGCTTT CGCACGAAGT CGAGCTTCTG AAGCGCCTGA TCAACGAATA G
 
Protein sequence
MQMNTMTTSG LDNGDAAPQA FGSIRLEAAE VKADMKQNVL FERVTARLKA QVGQDVYASW 
FARLKLHSVS KSVVRLSVPT TFLKSWINNR YLDLITGLFQ AEDPEILKIE VLVRTATRHG
TKALDEAVAP EPAAPTQMRR PASAQPAGQA VQQAVSAVAA ARPASFGSPL FGSPLDSRFT
FDTFVEGSSN RVALAAAKTI AEAGQGAVRF NPLFIHSTVG LGKTHLLQAV ANAAVQNPRA
LRVVYLTAEY FMWRFATAIR DNDALTLKDS LRNIDLLIID DMQFLQGKMI QHEFCHLLNM
LLDSAKQVVV AADRAPWELE SLDPRVRSRL QGGVAIEFDA PDYEMRLEIL KRRLAVARLE
DPSLEIPAEL LQHVARNVTA SGRELEGAFN QLVFRRSFEP NLSIERVDEL LAHLVGSGEP
RRVRIEDIQR IVARHYNVSR QELVSNRRTR VIVKPRQIAM YLSKTLTPRS FPEIGRRFGG
RDHTTVLHAV RKIEELISGD TKLSHEVELL KRLINE