Gene Smed_1157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1157 
Symbol 
ID5322003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1231597 
End bp1232814 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID640790098 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001326843 
Protein GI150396376 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.873553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.980751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTCG ACAGGCAGGC CCTTGGTTTC GGTTACGGTG AACATGCAGC GTATGCTTCG 
AACCCGTGGG CTTCGCGTGG CAGGCTCTAT CCCGAGGCGT CGAGCCCGAC GCGTTCGGAT
TTCCAGCGCG ACCGCGACAG GATCGTGCAT ACAACGGCAT TCCGGCGGTT GAAGCATAAG
ACGCAGGTCT TCATTGCCGC CGACGGTGAT CACTACCGCA CCCGGCTAAC CCATACGATC
GAAGTCGCCC AGATCGCGCG CGCACTCGCC AGAGCGTTGA ACCTGGACGA GGATCTCGCA
GAAGGCGTGG CGCTCGTCCA CGACTTCGGC CACACTCCCT TCGGGCACAC CGGCGAGGAC
GCGCTTGACG AGGTCCTGAA GCCCTATGGA GGGTTCGACC ATAACGCGCA GTCGCTGAGA
ATCGTCACCA AGCTGGAGCG GCGCTATGCG GAGTTCGATG GCCTCAATCT CACTTGGGAG
AGTCTCGAGG GGCTCGTCAA ACATAACGGC CCCCTGACGA CGGCGGACGG CCAGGGGCTT
CGCGGCCCGG TCTCGCAGCC GATCCTCGAC TACTGTGCTC TTCACGATCT CGAACTCGCG
AGCTTTGCAA GCCTCGAAGC GCAGGTTGCG GCAATCGCCG ATGATATCGC CTATAATACC
CACGATATCG ATGACGGTCT GCGCGCCGGC TATCTCACCT TCGAAATGCT GGAGGAGATA
CCGTTTCTCG CCCGGTTGAT GTACGAGGTT CGCGACCGCT ATCCGGGCCT TGAAAGCAGC
CGGTTCACGC ATGAGATCAT GCGGCGGCAG ATCACCGCCA TGGTGGAAGA CGTCATCGGC
GTTTCGCAGA GAGGGCTTGC GGACGTCCGG CCCGCAAGCG CAAGGGACGT GCGTTGCGCC
GGCAGGGTCA TCGCGACCTT CTCGGACGAA ATGAGCGAGA CGGACCGTCA GATCAAAAAT
CTGCTGATGA CGCGCATTTA CCGGCATCCG GAGGTCATGC GGGTACGAGA GGGAGCGGCA
TCGATCGTGA CGGACCTCTA CCGTGCCTTC ATGGACGATC CTTCGCTCAT GAAGGAACAC
TATTGGATCG ATCAGATCGC GGGGATGGAG GAGCCGGCCC GGGCCCGCCA TGTGGGGGAT
TATCTCGCCG GTATGACGGA TACTTTCGCG ATCAGCGTGC ATAGGCGTTT GTTTGACCAC
ACGCCCGATT TGCGCTAG
 
Protein sequence
MTVDRQALGF GYGEHAAYAS NPWASRGRLY PEASSPTRSD FQRDRDRIVH TTAFRRLKHK 
TQVFIAADGD HYRTRLTHTI EVAQIARALA RALNLDEDLA EGVALVHDFG HTPFGHTGED
ALDEVLKPYG GFDHNAQSLR IVTKLERRYA EFDGLNLTWE SLEGLVKHNG PLTTADGQGL
RGPVSQPILD YCALHDLELA SFASLEAQVA AIADDIAYNT HDIDDGLRAG YLTFEMLEEI
PFLARLMYEV RDRYPGLESS RFTHEIMRRQ ITAMVEDVIG VSQRGLADVR PASARDVRCA
GRVIATFSDE MSETDRQIKN LLMTRIYRHP EVMRVREGAA SIVTDLYRAF MDDPSLMKEH
YWIDQIAGME EPARARHVGD YLAGMTDTFA ISVHRRLFDH TPDLR