Gene Acid345_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4602 
Symbol 
ID4070759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5449731 
End bp5450861 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content59% 
IMG OID637986642 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_593676 
Protein GI94971628 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.161559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.698671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCG GCTATGCGGT GGACGTTGAA CAATCCCGAG GGCGGCGCAT CCCTGAGCCG 
CGGCACGCTT ACCGCAACGA CTTCCAGCGC GACCGCGATC GCGTGCTTCA TGCGCGGGCC
TTTCGTCGCT TAGAGAACAA GACGCAGGTC TTTACCGGCC GCTATTCCGA CCACTTTCGC
AATCGGCTGA CCCATACGAT TGAAGTCCAA CAGATTTCGC GTACGATCGC GAACGCGCTG
GATTTGAACG TTGACCTCGT TGAGGCGTTG GCGCTGGCGC ATGACATTGG GCATCCACCG
TTTGGACATG CCGGTGAGAA GGCGCTCGAT ACCGCGATGC GCAAGCACGG CGAGCGCTTC
GACCACAATC TGCACGCGCT GCGCATCGTG GACGATTTCG AGCTGCGCTA CATCGCGTTC
CGCGGCTTGA ATCTCACCTT CGAAGTGCGC GAGGGGATCA TCAAGCACTC GCGCGATTAC
AAGGAGAGCG AGCATCCGGA ACTGAAGGAG TATCTGCTCG ATCGCCGTCC GCCGCTGGAA
GCGCAGTTGA TCGACCTGAC CGACGAGATC GCCTACAACA CCGCCGACAT GGACGACGGT
TTCGAAGCGC GCATCTTGAA CATCGACGCG CTCCGCACGG TGCCGATCTT CGAGCGCTTC
TATCGCGAGG TGGAGGCGAA GCATCCCACG GCGCGGCGCA AACTGAAGTT CAACGAGACG
GTGAAGCGGA TCTTCGACCG GCTGGTCACC GACCTGATTG AGAACACGCG CAAACGCATC
GCAGACTCCG GCGTGAAGAC AGTTGAGGAT GTTCGCAACT ATCCCGAGCG GCTGGCGGCG
TTCAGTCCGG ATGTGGATGC GGAGCGCGCG GAGTCGAAGG CGTTCCTCTA CAAGAACCTC
TATTTCAGCG AAGCATTGCA AAACGAGAAG ATTGACGCGG AACTGATTGT CGGAGGATTG
TTCGGGCATT TTATGACCCA TCCCGAGAGT TTGCCGCCGG GCTACCAGGA GAAGGCGCAA
CAGGAAACAC GGGCACGCGT GGTGTGCGAC TACATCGCTG GGATGACCGA TAACTTCATC
CAGAGCAACT ACGAGCGGCT GATGACCGAC GAAGCGCCGA GCGAAGAATA G
 
Protein sequence
MPAGYAVDVE QSRGRRIPEP RHAYRNDFQR DRDRVLHARA FRRLENKTQV FTGRYSDHFR 
NRLTHTIEVQ QISRTIANAL DLNVDLVEAL ALAHDIGHPP FGHAGEKALD TAMRKHGERF
DHNLHALRIV DDFELRYIAF RGLNLTFEVR EGIIKHSRDY KESEHPELKE YLLDRRPPLE
AQLIDLTDEI AYNTADMDDG FEARILNIDA LRTVPIFERF YREVEAKHPT ARRKLKFNET
VKRIFDRLVT DLIENTRKRI ADSGVKTVED VRNYPERLAA FSPDVDAERA ESKAFLYKNL
YFSEALQNEK IDAELIVGGL FGHFMTHPES LPPGYQEKAQ QETRARVVCD YIAGMTDNFI
QSNYERLMTD EAPSEE