Gene Shewana3_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2147 
Symbol 
ID4478343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2571878 
End bp2573218 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content51% 
IMG OID639726735 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_869783 
Protein GI117920591 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000342411 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0447313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAA GTGTTTGGCA AGAACGCCGC CATGGCGAAG ATAAACAAAG ACGTAATGAT 
CATCGAAGCC CTTTCCAAAG GGACAGAGCA AGGATCCTCC ACTCCGCAGC TTTTCGCCGC
CTACAGGCCA AAACCCAAGT GCTTGGGGTA GGCATGAACG ATTTTTATCG CACGCGCTTA
ACCCATTCAC TCGAAGTGTC TCAAATCGGC ACTGGCATTG CGGCGCAGCT CAGCCGCAAG
TATCCCGAGC ATAAGCCCTT ATTAGGCTCG ATGAGCCTGC TCGAATCCCT CTGTCTAGCC
CATGATATTG GCCATCCGCC CTTTGGTCAT GGCGGTGAAG TCGCACTCAA CTATATGATG
CGCCACCACG GCGGCTTTGA AGGCAATGGC CAGACGTTTC GTATCCTCTC GAAACTCGAG
CCTTACACCG AGGCCTTTGG GATGAATCTG TGCCGCCGTA CTATGCTCGG CATTTTAAAA
TATCCCGCAT CGCAATCACT GCTGTTTGTG GCAGGTTCGC ATCCTGAAAT CACCAATCAC
AGACAGCTTA AACCATCACA ATGGCCGCCT GTTAAAGGCA TATTCGACGA TGATAGCGAC
ATTTTCGATT GGGTACTGGA ACCGTTGTCC GTTGCCGATA GAGCGCGCTT TACCTCCGTT
CAACCGAGCC TACAGCCAAA CTACCCGCAT CTACGCACTC AGTTTAAATC CTTCGATTGC
TCGATAATGG AACTGGCGGA CGACATCGCC TACGCGGTGC ACGATCTTGA AGATGCGATT
GTCATGGGCA TAGTCACCGC CTCGCAATGG CAACAGGATG TGGCGCCGAC ACTTAAGCAC
AGTGGCGATC CTTGGATCCG CCAAGAGCTT GCCGATATCG GCACTAAGCT CTTCTCCCAC
GAACATCATC TGCGAAAGGA TGCCATCGGT ACCTTAGTAA ATGGTTTTGT CACCGCCATT
ATTATCAACG ACGATCCGGC TTTCGAGGAA CCGTTGCTGC GGTTTAATGC CAGCCTAGAA
CCCGAATTTG CTAATGCGCT CAATGTGCTA AAGCAGTTAG TGTTTAAATA CGTTATCCGT
AAACCTGAGA TCCAAATGCT GGAATACAAG GGCCAACAGA TAGTGATGGG ACTCTTCGAA
GCGTTCGCCT CGGATCCCGA GCGGTTATTA CCACTCAATA CCCAAGAACG TTGGCGCACC
AGTGAGCAGC AAGGTCAAAA CAGCCACAGG GTGTTGGCAG ATTATATTTC TGGCATGACG
GATGAATTTG CCGGAAGACT GTACCAGCAG TTATTTAGCC CCAAGGCCGG CTCGAACGTG
GAACTCAGCA AAGAGATGTA G
 
Protein sequence
MSSSVWQERR HGEDKQRRND HRSPFQRDRA RILHSAAFRR LQAKTQVLGV GMNDFYRTRL 
THSLEVSQIG TGIAAQLSRK YPEHKPLLGS MSLLESLCLA HDIGHPPFGH GGEVALNYMM
RHHGGFEGNG QTFRILSKLE PYTEAFGMNL CRRTMLGILK YPASQSLLFV AGSHPEITNH
RQLKPSQWPP VKGIFDDDSD IFDWVLEPLS VADRARFTSV QPSLQPNYPH LRTQFKSFDC
SIMELADDIA YAVHDLEDAI VMGIVTASQW QQDVAPTLKH SGDPWIRQEL ADIGTKLFSH
EHHLRKDAIG TLVNGFVTAI IINDDPAFEE PLLRFNASLE PEFANALNVL KQLVFKYVIR
KPEIQMLEYK GQQIVMGLFE AFASDPERLL PLNTQERWRT SEQQGQNSHR VLADYISGMT
DEFAGRLYQQ LFSPKAGSNV ELSKEM