Gene Phep_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2365 
Symbol 
ID8253472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2761750 
End bp2763093 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content42% 
IMG OID644936015 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003092631 
Protein GI255532259 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0404358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTGGG AAAAATTATT ATCGGCTAAA CGCTGGGGAA ATGAAGATAG ATTTATGGGT 
AACCAAAAGG AATCCAGGTC TGAATTCCAG CGCGATTACG ACAGGATCAT CTTTTCTTCT
CCGTTCAGGA GGTTGCAGAA TAAAACCCAG GTATTTCCTT TACCAGGCAG TGTATTTGTG
CACAACAGAT TAACCCACAG CCTGGAAGTA GCCAGTGTTG GCAGGTCGAT GGGTACAATA
TTTTACAATA AGATAAAGGA TCTGGATCCT GGAATAGACG ATTCCTGTCC GCTACTCTGC
GAAATAGGAA ACATCATTGC TTCTGCCTGC CTGGCCCACG ACCTTGGCAA TCCTGCTTTT
GGCCACTCGG GAGAAGCGGC AATTTCAAGT TATTTTACCA GTGGAGCAGG ACAGATTTAT
CAGTCGCAGG TTACAGCAGC ACAATGGGAA GATCTGATCC ATTTTGAAGG AAATGCCAAT
GCTTTGCGTA TTTTAACCCA CCCGTTTACC GGTAAGGGTA CCGGGGGCTT TGCACTTACT
TATGCTACCC TGGCCGCCAT AGCAAAATAT CCCTGTGCCT CATTGGCGGG GCACAATAAA
AAAAACATCT ATACCAAAAA ATATGGCTTT TTCCAGTCTG AAGAAAGCGG CTTTGAGAAA
ATTGCCCTGG AAATGGGCCT CATCAAAGCG CAGGAATCTC CTTTGGTCTA TAAAAGACAT
CCGCTGGTAT ACCTGGTAGA GGCAGCGGAT GACATCTGTT ATAATATCAT TGACCTGGAA
GACGCACATC GGCTCAAAAT CCTTTCCTAT AAAGAAGTAG AGGCCTTGTT GCTGCCACTT
TGCAGGGATG AAAGGATGGA GGGGCGCCTT GCTGAAATAG ATGATGATGA TGCAAAAATC
ACTTTAATGC GGGCCAAGTC TATCAGTACC CTGATCGGTC TGTGTTCGGC AGTGTTTTTT
AAAGAACAGC AGCGCATCCT GGAAGGTAAC TTTAACCAAA GTTTGATGGA CGCTATTGAA
GAACCATTTT TATCGGTGAT GAAAGAAATC GAAAATATTT CTGTTAAAAA GATTTACAAT
TATTCATCTG TAGTACAAAT TGAGGTAGCG GGCTATCAGG TTATGGGTGG TTTGCTGGAA
GAGTTTATTC CGGCTTACCT GCAGGATGAA TCCAAATATC ACAAAAAACT TGTAGCATTA
ATTCCGAGAC AATTTTTAAC CGAAAAAACG GACACTTATT CAAAAATACA GAGCATACTG
GATTTTGTAT CTGGAATGAC AGACATTTAT GCAGTTGAAT TGTTTAGAAA AATCAAGGGA
ATATCATTTC CTTCGATGAG CTAA
 
Protein sequence
MVWEKLLSAK RWGNEDRFMG NQKESRSEFQ RDYDRIIFSS PFRRLQNKTQ VFPLPGSVFV 
HNRLTHSLEV ASVGRSMGTI FYNKIKDLDP GIDDSCPLLC EIGNIIASAC LAHDLGNPAF
GHSGEAAISS YFTSGAGQIY QSQVTAAQWE DLIHFEGNAN ALRILTHPFT GKGTGGFALT
YATLAAIAKY PCASLAGHNK KNIYTKKYGF FQSEESGFEK IALEMGLIKA QESPLVYKRH
PLVYLVEAAD DICYNIIDLE DAHRLKILSY KEVEALLLPL CRDERMEGRL AEIDDDDAKI
TLMRAKSIST LIGLCSAVFF KEQQRILEGN FNQSLMDAIE EPFLSVMKEI ENISVKKIYN
YSSVVQIEVA GYQVMGGLLE EFIPAYLQDE SKYHKKLVAL IPRQFLTEKT DTYSKIQSIL
DFVSGMTDIY AVELFRKIKG ISFPSMS