Gene Shewmr7_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_1933 
Symbol 
ID4258461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2284298 
End bp2285638 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID638122596 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_737979 
Protein GI114047429 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00212901 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0779588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAA GTGTTTGGCA AGAACGCCGC CATGGCGAAG ATAAACAAAG ACGTAATGAT 
CATCGAAGCC CTTTCCAAAG GGACAGAGCA AGGATCCTCC ACTCAGCCGC CTTTCGCCGC
CTACAGGCCA AGACCCAAGT GCTTGGGGTG GGCATGAATG ACTTTTATCG CACACGCTTA
ACCCATTCAC TCGAAGTGTC ACAAATCGGC ACGGGCATCG CCGCGCAGCT CAGCCGCAAG
TATCCCGAGC ATAAGCCCTT ATTAGGCTCG ATGAGCCTAC TCGAATCCCT CTGTCTTGCC
CATGATATTG GCCATCCGCC CTTTGGCCAT GGCGGTGAAG TCGCGCTCAA CTATATGATG
CGCCACCACG GTGGCTTTGA AGGCAATGGC CAGACGTTTC GTATTCTCTC GAAACTCGAG
CCTTACACCG AGGCCTTTGG GATGAATCTG TGCCGCCGTA CTATGCTCGG CATTTTAAAA
TATCCCGCGC CGCAATCACT GCTGTTTGTG GCAGGTTCGC ATCCTGAAAT CACCAATCAC
AGACAGCTTA AACCATCACA ATGGCCGCCT GTTAAAGGCA TATTCGACGA TGATAGCGAC
ATTTTCGATT GGGTACTGGA ACCGTTATCC GTTGCCGATA GAGCGCGCTT TACCTCCGTT
CAGCCGAGCC TGCAGCCAAA CTACCCGCAT CTACGCACTC AATTTAAATC CTTCGATTGC
TCGATTATGG AACTGGCAGA CGACATCGCC TACGCAGTGC ACGATCTTGA AGATGCGATT
GTCATGGGCA TAGTCACCGC CTCGCAATGG CAACAGGATG TGGCGCCGAC ACTTAAGCAC
AGTGGCGATC CTTGGATCCG CCAGGAGCTT GCCGATATCG GCACTAAGCT CTTCTCCCAT
GAACATCATC TGCGAAAGGA TGCCATCGGC ACCTTAGTAA ATGGTTTTGT CACCGCCATT
ATTATCAACG ACGATCCGGC CTTCGAGGAA CCGTTGCTGC GGTTTAATGC CAGCCTCGAA
CCCGAATTTG CTAATGCGCT CAATGTGCTA AAGCAGTTAG TGTTTAAATA CGTTATCCGT
AAACCCGAGA TCCAAATGCT GGAATACAAG GGCCAACAGA TAGTGATGGG ACTCTTCGAA
GCGTTCGCCT CGGATCCGGA GCGGTTATTA CCACTCAATA CCCAAGAACG CTGGCGCACC
AGTGAGCAGC AAGGCCAAAA CAGCCACAGG GTGTTGGCAG ATTATATTTC TGGCATGACG
GATGAATTTG CCGGACGACT GTACCAGCAG TTGTTTAGCC CCAAGGCTGG CTCCAACGTG
GAACTCAGCA AAGAGATGTA G
 
Protein sequence
MSSSVWQERR HGEDKQRRND HRSPFQRDRA RILHSAAFRR LQAKTQVLGV GMNDFYRTRL 
THSLEVSQIG TGIAAQLSRK YPEHKPLLGS MSLLESLCLA HDIGHPPFGH GGEVALNYMM
RHHGGFEGNG QTFRILSKLE PYTEAFGMNL CRRTMLGILK YPAPQSLLFV AGSHPEITNH
RQLKPSQWPP VKGIFDDDSD IFDWVLEPLS VADRARFTSV QPSLQPNYPH LRTQFKSFDC
SIMELADDIA YAVHDLEDAI VMGIVTASQW QQDVAPTLKH SGDPWIRQEL ADIGTKLFSH
EHHLRKDAIG TLVNGFVTAI IINDDPAFEE PLLRFNASLE PEFANALNVL KQLVFKYVIR
KPEIQMLEYK GQQIVMGLFE AFASDPERLL PLNTQERWRT SEQQGQNSHR VLADYISGMT
DEFAGRLYQQ LFSPKAGSNV ELSKEM