Gene VC0395_A1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1566 
Symboldgt 
ID5135809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1684351 
End bp1685676 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content52% 
IMG OID640533022 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001217506 
Protein GI147673736 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.169797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTAT CCCTAAACCC TGAGTGGTTA GCTCGTAACA ACGATGAGCA CAAAATTCGC 
CGCAACGATC ATCGCAGCCC ATTTCAGCGC GATCGCGCGC GTATCCTCCA TTCGGCGGCT
TTTCGGCGCT TGCAAGCCAA AACCCAAGTC CACGGAACAA GCTTGAATGA CTTTCATCGC
ACTCGCCTCA CCCATTCACT GGAAGCAGCG CAAATCGGTA CCGGCATCGT CGCGCAAATT
AAACTCAAAC AACCGGAGTT TCGTGAGCTA TTACCTTCTG ATAGCCTAAT TGATTCACTC
TGCCTTGCGC ACGATATTGG TCATCCCCCT TACGGGCATG GCGGTGAAAT TGCGCTCAAT
TATATGATGC GCGATCACGG TGGCTTTGAA GGCAATGCGC AGACTTTTCG GATCGTCACC
AGCTTAGAGC CTTACACTGA GCATCACGGC ATGAACCTGT CGCGCCGCAC GCTACTCGGG
CTTTTGAAAT ACCCTGCGCT GCTGAGTGCC ACGCGCGCTG CAATACCACC GCCAGCGGTC
GCCCACCAAC GCCAACTGAA AGCTAAAGAT TGGTCGCCTG CAAAAGGCAT CTACGATTGT
GATCTCGCGA GCTTGGACTG GGTGCTGGAG CCGCTGTGTG AAAGTGATCG TGAATTGTTG
GGACAAATGC GCGCAGAACC AAGCTCCCCC AAAGAGCACC GTAAAACTCG CTTTAAATCG
CTCGATTGCT CGATCATGGA ACTGGCGGAT GACATCGCTT ACGGCGTGCA TGATCTGGAA
GATGCGATTG TGCTGGGTAT GGTAACCCGC GCGCAGTGGC AAGAAGCCGC AGCGGCGCAG
CTTGCCGAGT GCGGCGATCC TTGGTTTGAA GAACATATTG CCGAGCTCAG TGAGATGCTG
TTTTCTGGTA AACACTATGT GTGCAAAGAT GCGATTGGCG GCATTGTAAA TGCCCTTTTA
ACCAGTATCA GCGTGAAGCC AGTTGAAGCG CCATTTCATA ATGAACTGTT GGCGTTCAAT
GCTTATATCG AGCCGCACAT GGGCAATGCG CTTGAAGTGC TCAAACACTT TGTGAGCCAA
TACGTGATTC AAATTCCGCA GGTACAGCGC TTTGAATACA AAGGCCAGCA ACTGATCATG
GATTTGTTTG AAGCGTTAAG TGCTGACCCA GAACGTCTAC TGCCACAAGC CACCGGCGAA
AAGTGGCGTA AAGCCCAAGA ACAAGACGAA GGCATGCGCG TGATCTGCGA TTACATTGCC
GCGATGACCG ATGCTTACGC GCAGCGACTG CATCAGCAGC TCTTCTCAGC GCAGAGTCAT
TACTGA
 
Protein sequence
MQVSLNPEWL ARNNDEHKIR RNDHRSPFQR DRARILHSAA FRRLQAKTQV HGTSLNDFHR 
TRLTHSLEAA QIGTGIVAQI KLKQPEFREL LPSDSLIDSL CLAHDIGHPP YGHGGEIALN
YMMRDHGGFE GNAQTFRIVT SLEPYTEHHG MNLSRRTLLG LLKYPALLSA TRAAIPPPAV
AHQRQLKAKD WSPAKGIYDC DLASLDWVLE PLCESDRELL GQMRAEPSSP KEHRKTRFKS
LDCSIMELAD DIAYGVHDLE DAIVLGMVTR AQWQEAAAAQ LAECGDPWFE EHIAELSEML
FSGKHYVCKD AIGGIVNALL TSISVKPVEA PFHNELLAFN AYIEPHMGNA LEVLKHFVSQ
YVIQIPQVQR FEYKGQQLIM DLFEALSADP ERLLPQATGE KWRKAQEQDE GMRVICDYIA
AMTDAYAQRL HQQLFSAQSH Y