Gene Rleg_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1679 
Symbol 
ID8012748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1672926 
End bp1674143 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content63% 
IMG OID644824266 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_002975505 
Protein GI241204409 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.451018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG ATACGCGTGC TTTGGGTTTC GGCAGCAGTG AGAGGGCGGT CTTTGCGGCC 
GACCCCTGGA CGACGCGCGG GCGGCTCTAT CAGGAGGACG GTAGCCCGAC GCGCTCCGAT
TTCCAGCGCG ACCGCGACCG CATCGTCCAC ACCACCGCCT TCCGCCGGCT GAAGCACAAG
ACCCAGGTCT TCATCGCCCA GGACGGCGAT CATTACCGAA CCCGGCTAAC GCACACGATC
GAGGTGGCGC AGATCGCTCG CGCGCTTGCC CGCGCCCTGA AGCTCGACGA GGATCTGGCC
GAAGGCGTGG CGCTCGTGCA CGATTTCGGC CATACGCCGT TCGGTCATAC TGGCGAGGAC
GCGCTGCACG AAGTGCTGCT GCCCTATGGC GGCTTCGACC ACAATGCCCA ATCGCTGCGC
ATCGTGACCA AACTGGAGAG GCGTTATGCC GAATTCGACG GCATCAACCT GACATGGGAA
AGCCTCGAAG GTCTCGTCAA GCACAATGGC CCGCTTCTGA CGCCGGATGG AGTAGGCACG
CGCGGTCCCG TGCCGCAGCC GATCCTCGAT TATTGCGAGC TGCACGATCT CGAGCTTGCA
ACCTATGCCA GCCTTGAGGC CCAGGTCGCG GCGATCGCCG ACGACATCGC CTACAATACC
CACGATATCG ACGACGGCCT GCGCTCCGGC TACTTGACTT TCGATATGCT GGAGGAAATC
CCGTTTCTTG CCGGGCTGAT GGCCGAGGTG AGGGCGCGAT ATCCGCATCT GGAGCCGAGC
CGCTTCACCC ATGAGATCAT GCGCCGCCAG ATTACCCGCA TGGTTGAAGA CGTGATCGGC
GTCGCGCAGC AGCGCCTGTC GCTGCTGCGC CCTGAGAGCG CTGCCGACAT CCGCGCTGCC
GGCCAGGTTA TCGCCACCTT TTCCGAGGGG ATGGCCGAGA CCGACAGGCA GATCAAGGCG
ATGCTCTTCA AACGCATCTA CCGCAATCCC GATATCATGC GCATCCGCGC CGGTGCCGCC
CAGATCGTCA CCGATCTCTT TGCCGCCTAC ATGGCCAATC CCAAGGAGAT GCAGAGCCAT
TACTGGGTAG ATCATATCGC CGGCCTGGCC GATGCGCCGA AGGCCCGCCA TGTCGGCGAT
TATCTCGCCG GCATGACCGA TACCTATGCG ATCAGCGCCC ATAGGCGGTT GTTTGACCAC
ACTCCGGATT TGCGATAG
 
Protein sequence
MTIDTRALGF GSSERAVFAA DPWTTRGRLY QEDGSPTRSD FQRDRDRIVH TTAFRRLKHK 
TQVFIAQDGD HYRTRLTHTI EVAQIARALA RALKLDEDLA EGVALVHDFG HTPFGHTGED
ALHEVLLPYG GFDHNAQSLR IVTKLERRYA EFDGINLTWE SLEGLVKHNG PLLTPDGVGT
RGPVPQPILD YCELHDLELA TYASLEAQVA AIADDIAYNT HDIDDGLRSG YLTFDMLEEI
PFLAGLMAEV RARYPHLEPS RFTHEIMRRQ ITRMVEDVIG VAQQRLSLLR PESAADIRAA
GQVIATFSEG MAETDRQIKA MLFKRIYRNP DIMRIRAGAA QIVTDLFAAY MANPKEMQSH
YWVDHIAGLA DAPKARHVGD YLAGMTDTYA ISAHRRLFDH TPDLR