Gene Ksed_11910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_11910 
Symbol 
ID8372699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1217055 
End bp1218383 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID644991469 
Productdeoxyguanosinetriphosphate triphosphohydrolase, putative 
Protein accessionYP_003148995 
Protein GI256825035 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0192589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.262164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAG AGCAGTCCTC CCGCTGGGTT CCGCCCCGCA CGGACGCCCC GGCCGGGGTG 
GCGCCGGTCG AGGCCTACTC CGCCCACGAC CGGCAGCGGT GGGAGAGCGA GGACCCCGCC
CGCAAGCGCG CCGACCGCGA CGACTTCGCC CGGGACCGCG GCCGCATCAT CCACGCCGCG
AGCCTGCGGC GCCTGTCCGC GAAGACGCAG GTGATGCAGT CGGGCATGGA CGACGTGGTG
CGCAACCGCC TGACGCACAG CCTGGAGGTC GCCCAGATCG GCCGGGAGAT GGCTGTCTCC
CTGGGGTGCA ACCCCGACGT CGTCGATGCG GCGTGCCTCG CCCACGACCT CGGTCACCCG
CCCTTCGGGC ACAACGGGGA GGACGCGCTC GCGGACGTGG CCCACGAGAT CGGCGGCTTC
GAGGGGAACG CGCAGTCGCT GCGTCTGCTG ACCCGCCTGG AGGCCAAGCG CTTCCGCGCC
GACGGCCGCA GCGTGGGGCT CAACCTCACA CGGGCCACCC TCGATGCGGT GGTCAAGTAC
CCGTGGCTGC GGGGGGAGGG CCCGGCGGGC ACGCCCAAGT ACAACGCCTA CGCCGACGAC
GCCGAGGTGT TCGCGTGGAT CCGCCAGGGC GAACCGGCGG CTGGTCGCCG ATGCCTGGAG
GCCCAGGTGA TGGACCTGGC CGACGACGTG GCCTACAGCG TGCACGACGT CGAGGACGCG
ATTACCTCCG GCCACCTGGA CGTGGCGGCC CTGCGGGACC CGGCGGAGGT CGCCTCGGTG
GCCGCCAACG CCGCTCGCAC GTACGCGGTG GGCCTGCCCG CGTCCCACCT GGTGGGCGCG
ATGGAGCGGC TGACCTCCAG CGGTGTGGTG CCCGTCTCCT ACGACGGCAC CCGTCGCGAC
CTGGCGCGGC TGAAGGACAT GACGAGCACC CTCATCGGGC ACTTCGTGCA GGAGGTCACC
GACGCCACCC GGGTGCAGCA TCCACAGGCG ACCCTGACTC GCTTCGACGC TGACCTGGTC
GTCCCCAGCG AGATCCTGGC CGAGATCGCC ATGCTCAAGG CAGTGGCGTA CACCTACATG
ATGAACACCG AGCACCGGCT GGAGCTGATG GAGCGGCAGC GCACTGCCCT CCAGGAGCTC
GTGGACATGT GGTGGCAGCA CCCTGACCGC ATGGACGCGC AGTACCTGGA GGACCACCGG
GAGGCGGAGC AGCGGGGCGA CGAGGCAGCG GCCCGGCGGG CCGTGGTCGA CCAGGTGGCC
TCGCTCTCGG ACGGGCGGGC CTGGCTGGAG CACCACCGCT GGTGCCGGCA CGGCGGGGGA
CCAGTCTGA
 
Protein sequence
MSSEQSSRWV PPRTDAPAGV APVEAYSAHD RQRWESEDPA RKRADRDDFA RDRGRIIHAA 
SLRRLSAKTQ VMQSGMDDVV RNRLTHSLEV AQIGREMAVS LGCNPDVVDA ACLAHDLGHP
PFGHNGEDAL ADVAHEIGGF EGNAQSLRLL TRLEAKRFRA DGRSVGLNLT RATLDAVVKY
PWLRGEGPAG TPKYNAYADD AEVFAWIRQG EPAAGRRCLE AQVMDLADDV AYSVHDVEDA
ITSGHLDVAA LRDPAEVASV AANAARTYAV GLPASHLVGA MERLTSSGVV PVSYDGTRRD
LARLKDMTST LIGHFVQEVT DATRVQHPQA TLTRFDADLV VPSEILAEIA MLKAVAYTYM
MNTEHRLELM ERQRTALQEL VDMWWQHPDR MDAQYLEDHR EAEQRGDEAA ARRAVVDQVA
SLSDGRAWLE HHRWCRHGGG PV