Gene Rleg_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4549 
SymbolxseA 
ID8015305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4676974 
End bp4678554 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID644827126 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_002978326 
Protein GI241207230 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG TCTTCGACGG CGATTCGCCG ACCAACCTTG CCGAATATTC GGTTTCGGAA 
CTTTCCGGCT CGATCAAGCG CACCGTCGAA ACCGCCTTCG ACCAGGTTCG CGTGCGCGGC
GAAATATCAG GCTATCGTGG GCCGCACTCC TCGGGCCATG CCTATTTCGC GCTGAAGGAC
GATCGCGCCC GCATCGACGC TGTCATCTGG AAGGGCACCT TCTCACGACT GAAGTTCCGT
CCGGAAGAGG GCATGGAAGT CATCGCCACC GGCAAGGTCA CCACCTTTCC GGGTTCCTCG
AAATATCAGA TCGTCATCGA GACGCTGGAG CCGGCCGGCG CCGGCGCGCT GATGGCGCTG
ATCGAGGAGC GCAAGCGCAA GCTCGGCGCC GAGGGCCTGT TCGATGCCGC CCGCAAAAAG
CGGCTGCCCT TCATGCCCGG CGTAATCGGC GTCGTCACCT CGCCGACCGG CGCCGTCATC
CGCGATATCC TTCACCGCAT CTCCGATCGT TTTCCTGTGC ATGTCCTCGT CTGGCCGGTG
AAGGTCCAGG GCGAGGGCTC CGGCGAGGAG GTGGCGAACG CCATCCGCGG CTTCAACGCG
CTGGAACCTT CAGGCGCCAT CCCGCGCCCG GATGTGTTGA TCGTCGCACG CGGCGGCGGC
AGCCTGGAAG ATCTCTGGAG CTTCAACGAC GAAATCGTCG TGCGTGCTGC GGCCGAAAGC
CGGATACCGC TGATCTCGGC CGTCGGCCAT GAGACCGACT GGACGCTGAT CGACTACGCC
GCCGATGTCC GTGCGCCCAC GCCGACGGGG GCAGCGGAAA TGGCAGTGCC GGTCAAGGCG
GAGCTCGAGG CGCAGGCCGC CGCTCTTGCC GCGCGCCTGC AGGGCTGCAT GAACCGGCAG
ATGGATCAGC GCCGCCAGTC GGTGCGTGCT CTGATGCGGG CATTGCCGTC GCTCGATCAG
CTTCTCGCCT TGCCGCGCCG CCGTTTCGAC GAGGCGGCAA CCGGTCTCGG CCGCGGGCTG
GAGCTCAACA CTATCAACAA GCGCCGCGGC TTCGAGCGTG TCGCCGCGCA TCTGCGCCCC
GATCTGCTCG CCGGCCGCAT CGCCGAGCGC CGCCAGACGC TGAACGAGCG CATGGCCCGG
GCCGAGCGCA TGGTTGAGCG GCTGATCGAC CGTTCGAAAT CGCGCGTCGA CCGCGCCGAA
GCCATCCTCG CCTCACTGCC GGCCCGGCTG AAGACCCAGA CCGACCGCGG TCGCGAACGC
CTCGGCAATC TTTCGCGCCA TGCCGATACG GCGGTCCGCC ACCAGCTGAC CCGCGCGCGC
GCCGAACTTT CTTCGCAGGA CCGCGTGCTG CAATCGCTCT CCTACAAGAA TGTGCTGAAG
CGCGGCTATG CCGTCATTCG CGATGAGGAT AACAGGCCGG TCTCGCAGGC TGCTCAGCTC
TCCGCCGGCA TGGGCATCGC CATCGAATTC GCCGACGGCC GTGTCGGCGC CATGACCACG
GAAGGCGGCG CACCGCCGGC CGGGGCCAAG AAGCGCAGCG CAAGACCCGC AGACCCACCG
AAGCAGGGAA GCCTGTTCTG A
 
Protein sequence
MSNVFDGDSP TNLAEYSVSE LSGSIKRTVE TAFDQVRVRG EISGYRGPHS SGHAYFALKD 
DRARIDAVIW KGTFSRLKFR PEEGMEVIAT GKVTTFPGSS KYQIVIETLE PAGAGALMAL
IEERKRKLGA EGLFDAARKK RLPFMPGVIG VVTSPTGAVI RDILHRISDR FPVHVLVWPV
KVQGEGSGEE VANAIRGFNA LEPSGAIPRP DVLIVARGGG SLEDLWSFND EIVVRAAAES
RIPLISAVGH ETDWTLIDYA ADVRAPTPTG AAEMAVPVKA ELEAQAAALA ARLQGCMNRQ
MDQRRQSVRA LMRALPSLDQ LLALPRRRFD EAATGLGRGL ELNTINKRRG FERVAAHLRP
DLLAGRIAER RQTLNERMAR AERMVERLID RSKSRVDRAE AILASLPARL KTQTDRGRER
LGNLSRHADT AVRHQLTRAR AELSSQDRVL QSLSYKNVLK RGYAVIRDED NRPVSQAAQL
SAGMGIAIEF ADGRVGAMTT EGGAPPAGAK KRSARPADPP KQGSLF