Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1566 |
Symbol | dgt |
ID | 5135809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1684351 |
End bp | 1685676 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640533022 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_001217506 |
Protein GI | 147673736 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.169797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGTAT CCCTAAACCC TGAGTGGTTA GCTCGTAACA ACGATGAGCA CAAAATTCGC CGCAACGATC ATCGCAGCCC ATTTCAGCGC GATCGCGCGC GTATCCTCCA TTCGGCGGCT TTTCGGCGCT TGCAAGCCAA AACCCAAGTC CACGGAACAA GCTTGAATGA CTTTCATCGC ACTCGCCTCA CCCATTCACT GGAAGCAGCG CAAATCGGTA CCGGCATCGT CGCGCAAATT AAACTCAAAC AACCGGAGTT TCGTGAGCTA TTACCTTCTG ATAGCCTAAT TGATTCACTC TGCCTTGCGC ACGATATTGG TCATCCCCCT TACGGGCATG GCGGTGAAAT TGCGCTCAAT TATATGATGC GCGATCACGG TGGCTTTGAA GGCAATGCGC AGACTTTTCG GATCGTCACC AGCTTAGAGC CTTACACTGA GCATCACGGC ATGAACCTGT CGCGCCGCAC GCTACTCGGG CTTTTGAAAT ACCCTGCGCT GCTGAGTGCC ACGCGCGCTG CAATACCACC GCCAGCGGTC GCCCACCAAC GCCAACTGAA AGCTAAAGAT TGGTCGCCTG CAAAAGGCAT CTACGATTGT GATCTCGCGA GCTTGGACTG GGTGCTGGAG CCGCTGTGTG AAAGTGATCG TGAATTGTTG GGACAAATGC GCGCAGAACC AAGCTCCCCC AAAGAGCACC GTAAAACTCG CTTTAAATCG CTCGATTGCT CGATCATGGA ACTGGCGGAT GACATCGCTT ACGGCGTGCA TGATCTGGAA GATGCGATTG TGCTGGGTAT GGTAACCCGC GCGCAGTGGC AAGAAGCCGC AGCGGCGCAG CTTGCCGAGT GCGGCGATCC TTGGTTTGAA GAACATATTG CCGAGCTCAG TGAGATGCTG TTTTCTGGTA AACACTATGT GTGCAAAGAT GCGATTGGCG GCATTGTAAA TGCCCTTTTA ACCAGTATCA GCGTGAAGCC AGTTGAAGCG CCATTTCATA ATGAACTGTT GGCGTTCAAT GCTTATATCG AGCCGCACAT GGGCAATGCG CTTGAAGTGC TCAAACACTT TGTGAGCCAA TACGTGATTC AAATTCCGCA GGTACAGCGC TTTGAATACA AAGGCCAGCA ACTGATCATG GATTTGTTTG AAGCGTTAAG TGCTGACCCA GAACGTCTAC TGCCACAAGC CACCGGCGAA AAGTGGCGTA AAGCCCAAGA ACAAGACGAA GGCATGCGCG TGATCTGCGA TTACATTGCC GCGATGACCG ATGCTTACGC GCAGCGACTG CATCAGCAGC TCTTCTCAGC GCAGAGTCAT TACTGA
|
Protein sequence | MQVSLNPEWL ARNNDEHKIR RNDHRSPFQR DRARILHSAA FRRLQAKTQV HGTSLNDFHR TRLTHSLEAA QIGTGIVAQI KLKQPEFREL LPSDSLIDSL CLAHDIGHPP YGHGGEIALN YMMRDHGGFE GNAQTFRIVT SLEPYTEHHG MNLSRRTLLG LLKYPALLSA TRAAIPPPAV AHQRQLKAKD WSPAKGIYDC DLASLDWVLE PLCESDRELL GQMRAEPSSP KEHRKTRFKS LDCSIMELAD DIAYGVHDLE DAIVLGMVTR AQWQEAAAAQ LAECGDPWFE EHIAELSEML FSGKHYVCKD AIGGIVNALL TSISVKPVEA PFHNELLAFN AYIEPHMGNA LEVLKHFVSQ YVIQIPQVQR FEYKGQQLIM DLFEALSADP ERLLPQATGE KWRKAQEQDE GMRVICDYIA AMTDAYAQRL HQQLFSAQSH Y
|
| |