Gene Mjls_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3454 
Symbol 
ID4879166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3627680 
End bp3628951 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content71% 
IMG OID640140759 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001071723 
Protein GI126436032 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.381116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCAC GACTGCAGGA CAGCTACGAC GAGTTCGATC GCCAACGCCG GGTGGACGAA 
CCGGCGAAGA GTGCTGTCCT GCCGGGGACC GGCACCGAGC ACCGCACCGA CTTCGCACGC
GACCGCGCCC GGGTGCTGCA CTGTGCGGCG CTGCGCCGCC TCGCCGACAA GACGCAGGTG
GTCGGCCCAC GCGAGGGGGA CACCCCGCGG ACCCGGCTGA CGCATTCGCT CGAGGTCGCC
CAGATCGGCC GCGGGATGGC CGTCGGCCTG GGCTGCGACC CCGACCTCGT CGACCTCGCC
GGCCTCGCCC ACGACATCGG CCACCCGCCC TACGGCCACA ACGGCGAACG CGCACTCAAC
GAGATCGCCA AGGCCTTCGG GGGTTTCGAG GGCAATGCGC AGAACTTCCG CATCCTCACG
CGGCTGGAAC CCAAGGTGCT CGACGCGACC GGTCGCAGCG CCGGGCTCAA CCTGACCAGG
GCGGCGCTGG ATGCGGTGAC GAAATACCCG TGGCAGCGCG GTGACCGCAC GAAGTTCGGC
TTCTACGGCG ACGACATGGC CGCGGCCCGG TGGGTGCGCG ACGGCGCCCC GGCCGAGCGG
CCGTGCCTGG AGGCCCAGGT GATGGACTGG GCCGACGACG TCGCCTACTC GGTGCACGAC
GTCGAGGACG GCGTCGTCTC CGGCCGCATC GACCTGCGGG TGCTGGCCGA CGACGATGCG
GCCGCCTCGC TGGCGCGCCT GGGCGCCGAG GCATTCCCGA CCCTGGCGCC CGACGACCTG
CTGGCCGCGG CCGAACGCCT CTCGCAGATG CCGGTGGTGT CGCAGGTGGG TAAGTACGAC
GGAACCCTGG GCGCATCGGT CGCGCTCAAA CGGATGACCA GCGAACTGGT CGGCCGCTTC
GCCAACGCGG CGATCACCGA GACCAGGTCG GTCGCGGGGG GAGGTGCGCT ACACCGTTTT
GATACCGAGC TGGCGGTGCC GACCCTGGTG CGCGCCGAGG TGGCGGTGCT GAAAATGCTG
GCGCTGCAGT TCATCATGAG CGACCACGGG CACCTGGGGA TCCAGGCCGA CCAGCGCACC
CGCATCCACG AGGTGGCCCT GATCCTGTGG GGGCAGGCGC CGAGCAGCCT GGATCCGCTG
TTCGCGCCCG AGTTCGTCGC CGCCGAGGAC GACGGCGCCC GGCTGCGCGT GGTGATCGAT
CAGATCGCGT CCTACACCGA GGGGCGGTTG GAACGAGTGC ACGAAGCCCG ATCGCCCCGA
CCTCTAGACT GA
 
Protein sequence
MNPRLQDSYD EFDRQRRVDE PAKSAVLPGT GTEHRTDFAR DRARVLHCAA LRRLADKTQV 
VGPREGDTPR TRLTHSLEVA QIGRGMAVGL GCDPDLVDLA GLAHDIGHPP YGHNGERALN
EIAKAFGGFE GNAQNFRILT RLEPKVLDAT GRSAGLNLTR AALDAVTKYP WQRGDRTKFG
FYGDDMAAAR WVRDGAPAER PCLEAQVMDW ADDVAYSVHD VEDGVVSGRI DLRVLADDDA
AASLARLGAE AFPTLAPDDL LAAAERLSQM PVVSQVGKYD GTLGASVALK RMTSELVGRF
ANAAITETRS VAGGGALHRF DTELAVPTLV RAEVAVLKML ALQFIMSDHG HLGIQADQRT
RIHEVALILW GQAPSSLDPL FAPEFVAAED DGARLRVVID QIASYTEGRL ERVHEARSPR
PLD