Gene Rleg_5603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5603 
Symbol 
ID8016829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp183650 
End bp186949 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content63% 
IMG OID644827769 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_002978969 
Protein GI241518341 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00446711 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGATACG CCGAGCTCCA GGTCACGACG CATTTCAGCT TCCTTCGTGG CGCGTCCTCC 
GCCGACGAGC TCTTTTCTAC CGCCAGGGAA CTGGCCATCG AGGCTCTCGG CGTTGTCGAT
CGCAACAGCC TGGCAGGGGT CGTCCGGGCG CTCGAGGCGT CGCGCGCGAC AGGCGTCCGT
CTCGTTGTTG GGTGTCGGCT GGATCTGCAG GAGGGCATGT CGATTCTCCT TTATCCCACG
GACCGCGGCG CCTATTCGCG GCTCACCCGG CTCCTCACGC TTGGCAAGGG CAGGGGCGGC
AAGGCGAACT GCATCTTGAA CCTCGACGAT GTCGCTCTCT ACTCCGAGGG GCTGCTTGCA
ATCCTTGTGC CCGACCTAGC TGACGAGACC TGCGCCGTCC AACTTCGCAA GATGGCCGAG
ATATTCGAGG ACCGAGCATA TGTCTCGCTT TGCCTCCGGC GGCGTCCGAA CGACCAGTTG
AGGCTGCACG AATTGTCGAA CATGGCGATG AGGCACCGCG TGCGAACAGT CGTCACCAAC
GACGTCCTCT TCCACGATCC TTCCAGGCGG CAGCTTCAGG ACGTCGTCAC CTGCATTCGA
AACAATACCA CCATCGACGA CGTCGGCTTC AAGCGCGAGC GCCACGCGGA CAGATATCTA
AAGCCGCCGG AGGAGATGGA GCGCCTGTTC CCGCGCTACC CCGAGGCGCT GGCCCGGACG
ATGGAGATCG TCGACCGCTG CAGGTTCTCG CTGGAGGAAC TGACCTACCA GTATCCCGAG
GAGGCGATCC TCCCGGACAA GACCCCGCAG GAATCCCTCG AGCACTATGT CTGGGAGTGC
GTGCCCAACC GCTATCCGGA AGGGCTGCCG CCGGAGGTTC TCAAGATCGT CCGTCACGAA
CTCGATCTCA TCCGGACGAT GAAATACGCC CCATACTTCC TGACCGTCTT CTCGATCGTC
CGCTTCGCCC GGTCGCAAGG CATCCTTTGC CAGGGGCGGG GCTCGGCGGC GAACTCGGCC
GTCTGTTACA TCCTTGGTAT CACCAGCATC GACCCCTCGA CCAACGATCT CCTGTTCGAG
CGCTTCGTCT CCCAGGAGCG CGACGAGCCG CCGGACATCG ACGTCGACTT CGAACACGAA
AGAAGGGAGG AGGTCATCCA ATGGATCTAC AAGACCTACG GGAAGGATAA GGCCGCTCTC
TGTTCGACGG TGACCCGTTA CAGGGCGAAG GGTGCCATCC GCGACGTCGG CAAGGCGCTC
GGCTTGCCTG AGGACCTCAT CAAGGCCCTG TCAACGGGCA TGTGGGCGTG GTCGGAAGAG
CTGGTCAGTG ATCGCAGTCT TCGTGATCAG GGCTTGAACC CGCAAGACCG CCGCCTCGCC
CTGACGCTTA GGCTTGCGCA GCAGCTCATG GGCGCTCCTC GGCATCTGGG ACAGCATCCA
GGCGGCTTCG TCCTGACCCA CGACCGCCTC GACGATCTCG TTCCGATCGA ACCCGCGGCA
ATGGTTGACC GGCAAGTGAT CGAGTGGGAC AAGGACGACG TCGAGGCGCT CAAATTCATG
AAGGTCGACA TCCTGGCGCT CGGTATGCTG ACCTGCATGG CGAAGGCCTT CGCGCTCATC
GAAGAACACA AGGACGAGCA CCTTGATCTC GCCACCATCC CTCAGGAGGA CCAGGCGACA
TACGCGATGA TCAGGAAAGC CGACACCCTC GGCACGTTCC AGATCGAGTC GAGGGCGCAA
ATGGCGATGT TGCCGCGCCT GAAGCCGCGG ACCTTCTATG ACCTCGTGAT CCAGGTCGCG
ATCGTCCGGC CTGGTCCGAT TCAGGGCGAT ATGGTGCACC CCTATCTCCG CCGCCGGGAG
GGCAAGGAGC GGGTCGTCTA CCCGACGCCG GAACTCGAAG CCGTTCTCGG CAAGACACTC
GGCGTCCCGC TCTTCCAGGA GTCGGCCATG AAGGTGGCGA TGGTTTGCGC CGGATTTACC
GGCGGCGAGG CAGACCAGCT TCGCAAGTCG ATGGCCACCT TCAAGTTCAC CGGCGGCGTC
TCGCGCTTCA AGGACAAGCT CGTCTCCGGC ATGATCAGGA ACGGCTACAC GGCCGAGTTT
GCCGAGAAGA CCTTCAGCCA GCTCGAGGGA TTCGGCAGTT ATGGCTTCCC GGAATCCCAT
GCGGCGTCGT TCGCCTTGAT CGCCTATGCC TCCAACTATG TGAAATGCCA CTGTCCGGAC
GTCTTTTGCG CGGCGCTGCT GAACTCGCAG CCGATGGGAT TTTACGCGGC GGCCCAGATC
GTCGGTTGCG CTCGGAACCA TGGCGTCGAG ATCCGACCGA TCTGCATCAA CAATTCCCGG
TGGGACTGTA CGCTTGAGCG TATCGGAGAT ACCGACCATC ACGCGGTCCG CCTGGGAATG
CGCATGGTGC GCGGCCTTGC CGCCGCCGAT GCCGCGCGTG TCGCGGCTGC TCGCATGGAC
CAACCGTTCG AAAGCGTTGA CGATATGTGG CGACGGTCTG GGGTCCCGGC GGCTTCTCTG
GTCGAACTCG CCGAGGCCGA CGCCTTCCTG CCATCTCTGG GACTTCAGCG GCGCGACGCA
TTGTGGGCGA TTAAGGCTCT GCGTGACGAA CCTCTTCCGC TCTTTGCAGC CGCCTCGGAA
CGCGAGGCGA GGGCGATCGC CGAGCAGCAG GAACCCGAAG TCGCTCTGCG CCAGATGACG
GATGGACACA ACGTTGTCCA GGATTACAGT CACATCGGGT TGACGCTCCG CCAACATCCC
GTCGCATTCC TTCGAAAAGC TTTGGCAGAG CGCCAGATCG TCACCTGCGC GCAAGCCATG
AACGCGCGGG ATGGTCGCTG GTTGATGACA GCTGGCCTGG TCCTCGTCCG GCAGAGACCC
GGCAGCGCCA AGGGCGTCAT CTTCATGACG ATCGAGGACG AGACCGGTCC GGCCAACGTC
GTCGTCTGGC CCAAGCTCTT CGAGCAGCGC CGCCGTATCA TCCTCGGGGC ATCGATGATC
GCGATCAATG GGAGAATTCA GCGTGAAGGC GACGTCGTTC ATCTCGTGGC CCAGCAAGCC
TTCGATCTGT CGGGCGATCT CTCCGGGCTG GCTGAGCGCG ACGCAGGGTT CCGGCTCCCG
ACGGGCAGGG GAGACGAGTT CGCGCATGGA TCGCCCGGAA GTCCGGATTC GCGCGACAGG
GTGGCGGGCG CGAGGCCGAG GGACATTTTC ATTCCGTTAT GCCGAACTCC GCATAAGGGA
ACTTATCCCG AGCCGGAGAC TATGCCGAGC CCGTTTCCCA AAGCCCGAGA TTTCCGGTGA
 
Protein sequence
MRYAELQVTT HFSFLRGASS ADELFSTARE LAIEALGVVD RNSLAGVVRA LEASRATGVR 
LVVGCRLDLQ EGMSILLYPT DRGAYSRLTR LLTLGKGRGG KANCILNLDD VALYSEGLLA
ILVPDLADET CAVQLRKMAE IFEDRAYVSL CLRRRPNDQL RLHELSNMAM RHRVRTVVTN
DVLFHDPSRR QLQDVVTCIR NNTTIDDVGF KRERHADRYL KPPEEMERLF PRYPEALART
MEIVDRCRFS LEELTYQYPE EAILPDKTPQ ESLEHYVWEC VPNRYPEGLP PEVLKIVRHE
LDLIRTMKYA PYFLTVFSIV RFARSQGILC QGRGSAANSA VCYILGITSI DPSTNDLLFE
RFVSQERDEP PDIDVDFEHE RREEVIQWIY KTYGKDKAAL CSTVTRYRAK GAIRDVGKAL
GLPEDLIKAL STGMWAWSEE LVSDRSLRDQ GLNPQDRRLA LTLRLAQQLM GAPRHLGQHP
GGFVLTHDRL DDLVPIEPAA MVDRQVIEWD KDDVEALKFM KVDILALGML TCMAKAFALI
EEHKDEHLDL ATIPQEDQAT YAMIRKADTL GTFQIESRAQ MAMLPRLKPR TFYDLVIQVA
IVRPGPIQGD MVHPYLRRRE GKERVVYPTP ELEAVLGKTL GVPLFQESAM KVAMVCAGFT
GGEADQLRKS MATFKFTGGV SRFKDKLVSG MIRNGYTAEF AEKTFSQLEG FGSYGFPESH
AASFALIAYA SNYVKCHCPD VFCAALLNSQ PMGFYAAAQI VGCARNHGVE IRPICINNSR
WDCTLERIGD TDHHAVRLGM RMVRGLAAAD AARVAAARMD QPFESVDDMW RRSGVPAASL
VELAEADAFL PSLGLQRRDA LWAIKALRDE PLPLFAAASE REARAIAEQQ EPEVALRQMT
DGHNVVQDYS HIGLTLRQHP VAFLRKALAE RQIVTCAQAM NARDGRWLMT AGLVLVRQRP
GSAKGVIFMT IEDETGPANV VVWPKLFEQR RRIILGASMI AINGRIQREG DVVHLVAQQA
FDLSGDLSGL AERDAGFRLP TGRGDEFAHG SPGSPDSRDR VAGARPRDIF IPLCRTPHKG
TYPEPETMPS PFPKARDFR