Gene Msil_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0494 
Symbol 
ID7091227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp548768 
End bp549985 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID643463824 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002360828 
Protein GI217976681 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCT CACCCTTTCG GCCCGTCCGC GCCGCCTATG CGGCCGACTC CGACGCGAGC 
CGCGGGCGGC TGTTCAGCGA GCCGGGCTCT CCGTCGCGGA CCAAATTCCA ACGCGATCGC
GACCGCATCA TCCATTCGAC AGCTTTCCGG CGGCTGGCGC ATAAAACGCA GGTTTTTATC
CCGCATGAGG GCGATCATTA CCGCACGCGC CTCACCCATT CGCTCGAAGT GGCCCAGATC
GCGCGGGCTC TGGCGCGCGC GCTCGGCCTC GATGACGATC TCGCCGAAGC GGTCGCTCTT
GCGCATGATC TCGGCCATCC GCCGTTCGGC CATGCCGGCG AAGACGCCCT TCATGTGATG
ATGGCGCCCT ATGGCGGCTT CGACCACAAT GCGCAAACGC TGCGGATCGT CACCAAGCTG
GAGCGACGCT ATGCCGCATT CGACGGGCTC AATCTGACCT TCGAAACGCT CGAGGGCCTC
GTCAAACATA ATGGCCCGCT GCGTTTGCCC GATGGGTCGC CAACGCCGCG CTTTGCGAAA
TCGGGCGTTC CCGCCGCCAT TCTCGAATTC GACGCGCTTT TCCATCTCGA TCTCGCCCGC
TTCGCCAGCG CCGAGGCGCA GGCCGCGGCC ATCGCCGACG ACATCGCCTA TAACGCCCAT
GACATCGACG ATGGCCTTCG CGCCGGACTG ATCGACGTCG TCGATATTGC GCAGGCGCCT
TTTCTCGGCG ATCTCGCCGC CGAAGTCGCG GCGCTGCGCC CCGGCCTCGA GCCGGCGCGC
TTCGTTCACG AACTGGTGCG CCGCCTGATC ACGCGTTTCA TCGAGGACGC CATCGGCGAA
AGCAGGCTGC GCCTCGATGC GGCCGGCGCC GGCAGCGCCG AAGATATCCG CGACGGCGCG
GCGCCGATGG TCGCCTTCTC CCCGGCGATG ACGCGGCTCG AGGCGGAGAT CAAGCAGTTT
CTTCTGCATA TCCTCTATCG CCGCGAGCCG CTCAATCGCA TCCGCGCGCA GGCGGCGGAG
GTGATCTTCA ATCTGTTTCC GCATTTTTTC GGAAAGCCCA CGAGCATGCC GGTGGATTGG
GCGGCGGCGG CGCAAGCGGC GGGGGGCGAC GACACGCGCC GCGCGCGGGT CATCTGCGAT
TACATCGCCG GCATGACCGA TCGTTACGTG TTGCAGGAGC ATCGCCGCAT TTTTGGCGCG
GCGCCGGAGC TTCGGTAG
 
Protein sequence
MSVSPFRPVR AAYAADSDAS RGRLFSEPGS PSRTKFQRDR DRIIHSTAFR RLAHKTQVFI 
PHEGDHYRTR LTHSLEVAQI ARALARALGL DDDLAEAVAL AHDLGHPPFG HAGEDALHVM
MAPYGGFDHN AQTLRIVTKL ERRYAAFDGL NLTFETLEGL VKHNGPLRLP DGSPTPRFAK
SGVPAAILEF DALFHLDLAR FASAEAQAAA IADDIAYNAH DIDDGLRAGL IDVVDIAQAP
FLGDLAAEVA ALRPGLEPAR FVHELVRRLI TRFIEDAIGE SRLRLDAAGA GSAEDIRDGA
APMVAFSPAM TRLEAEIKQF LLHILYRREP LNRIRAQAAE VIFNLFPHFF GKPTSMPVDW
AAAAQAAGGD DTRRARVICD YIAGMTDRYV LQEHRRIFGA APELR