Gene Msil_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1017 
Symbol 
ID7091845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1103844 
End bp1105004 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID643464356 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_002361348 
Protein GI217977201 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00286027 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAATCGC TTTTCCCGAA ACTGGATCAG GATGATTTCG GCCCGCATTT CGGACTGCTG 
TCCCGCCAGA AGATCGAGCT GATGGCGACG CGCCGCATGA TCCAGGCGGC GGACGCCCTT
GACGAGCGAC AGCTTCAGCC GGCGAGCCTT GACCTGCGCC TCGGCGCGCG GGCCTATCGC
GTGCGGGCCA GCTTCTTGCC GGGGCGCGAA CGCACGGTGA TGGAGCAGCT GCGCGCCTTC
GCCCGGGACG AGGATGCAAT CAGCCTCGAA CAGGGCGCCG TGCTGGAGCG CGGCTGCGTC
TATGTCATCC CGCTGATCGA GCATCTGCGC CTGCCCGACA GCATCGCGGC TTTCGCTAAT
CCGAAAAGCT CGACCGGCCG GCTCGACATT TTCACGCGGC TCATCACCGA TAATTCGGAG
GTGTTCGACC GCGTCGCCCG CGCCTATGAG GGGCCGCTCT ACGCCGAGGT GTCGCCGCGC
AGCTTTTCGG TGCGCGTTCG CAAAGGATCG AAACTGAACC AGATCCGCTT CCGGCGGCTG
AATTCGCAGC AGCTCGAACG CACCGGATTT GCGGTCGACG ATCGCGATCT ACGCGAACGA
CACAAGGCGG CGTCCCTCGT CGACGGCGAG CTCAATTTGC GTCAGGGACT TGTCGTGCGG
GTCGCGCTGA GCGCGGCGAT CCAGCCGGAC GGCGCCATCG GGTACCGCGC GCAAAAACAC
GCCGACATCA TCGACGTCGA CCGCGCCGGC GGCTACCGGC TCGACGATTA TTGGGACAGG
ATTTTCGCGC GGCCGGACGG GCGGCTCATT CTCGATCCCG GCGAGTTCTA CATCCTCGCC
TCGCAGGAGC GCCTGCACAT TCCAAGCGAT CTCGCCGCCG AAATGGTGCC GATCGATCCG
GCCATGGGCG AATTTCGCGT TCATTATGCG GGCTTTTTCG ATCCAGGCTT TGGCGCGTCC
CCCGATAATC GTCCCGGCGC TCGCGCCGTG CTCGAGGTGC GCAGCCACGA GGTGCCCTTC
GTGCTGGAGG ACGGCCAGAT CATCGGCCGG CTGGTCTATG AGAAAATGGC GGAGGCGCCG
CATGCGCTTT ACGGCGAGGG AGAGGGCTCC AATTATCAGG GCCAGGGACT AAAGCTGTCG
AAGCATTTTG TGATGGATTA G
 
Protein sequence
MQSLFPKLDQ DDFGPHFGLL SRQKIELMAT RRMIQAADAL DERQLQPASL DLRLGARAYR 
VRASFLPGRE RTVMEQLRAF ARDEDAISLE QGAVLERGCV YVIPLIEHLR LPDSIAAFAN
PKSSTGRLDI FTRLITDNSE VFDRVARAYE GPLYAEVSPR SFSVRVRKGS KLNQIRFRRL
NSQQLERTGF AVDDRDLRER HKAASLVDGE LNLRQGLVVR VALSAAIQPD GAIGYRAQKH
ADIIDVDRAG GYRLDDYWDR IFARPDGRLI LDPGEFYILA SQERLHIPSD LAAEMVPIDP
AMGEFRVHYA GFFDPGFGAS PDNRPGARAV LEVRSHEVPF VLEDGQIIGR LVYEKMAEAP
HALYGEGEGS NYQGQGLKLS KHFVMD