Gene Msil_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0103 
Symbol 
ID7090420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp94471 
End bp95523 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content62% 
IMG OID643463437 
Productarsenical-resistance protein 
Protein accessionYP_002360447 
Protein GI217976300 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTTT TTGAACGCAC GCTCTCCCTG TGGGTATGCG CCTGCATCGT TGCGGGCGTC 
ATTCTCGGCC AATTGGCGCC GGGGCTATTC CAGGCTATCG GCGCGATCGA AATCGCCAAG
GTCAATCTGC CCGTTGCGGC GCTGATCTGG CTGATGATCG TTCCCATGCT GCTCAGGATC
GATTTTGCGG CGCTCGGCGA AGTGCGGCGT CATTGGCGCG GCATGGGGGT GACGCTGTTT
ATCAATTGGG CGGTGAAGCC CTTTTCGATG GCGGCGCTCG GCTGGCTGTT TATCGGCCAT
TTTTTCCGGC CGTTTCTACC GGCCGATCAA ATCGACTCTT ATATTGCGGG CCTTATTCTT
CTGGCCGCGG CGCCCTGCAC GGCGATGGTT TTCGTCTGGT CCAATCTCGT CAAGGGAGAG
CCGCATTTCA CCCTGAGTCA GGTGGCGCTG AATGACGTCA TCATGGTTGT GGCTTTCGCG
CCCCTGGTCG GGCTTCTGCT CGGCCTGTCG GCGATCGTCG TGCCATGGGA CACGCTCGCT
TTGTCTGTCG GGCTCTACAT CGTGATTCCT GTCATTGCAG CGCAATTGGC CCGGCGCGCG
CTGCTTGCCG GCGGCGCCGA CGCCTTCGCG CGCGTCCTGG CCATTCTACA GCCATTGTCG
CTTGCCGCAT TGCTCGCGAC CTTGGTGCTG TTGTTCGGGT TCCAGGGCGA GCAGATCGCC
GCTCAGCCGC TGGTTATTTT GATGCTCGCC GCGCCGATCC TGATCCAGGT TTATTTCAAC
GCGGGGCTTG CCTATCTGCT CAATCGGATC GTCGGCGAGC CGCATTGCGT CGCCGGGCCC
TCCGCGATGA TCGGCGCCAG CAATTTTTTT GAGCTCGCCG TCGCCGCCGC GATCAGCCTG
TTTGGCTTCC GGTCCGGCGC GGCGCTGGGG ACGGTCGTCG GCGTGCTGAT CGAAGTCCCG
GCGATGCTGT CTCTTGTCTA TATCGTCAAC GCCAGCCGCG GCTGGTATGA GCGCGCGGAG
CCGGCGCCGG CGCCGCGACG GGCGGAGGGA TAA
 
Protein sequence
MSFFERTLSL WVCACIVAGV ILGQLAPGLF QAIGAIEIAK VNLPVAALIW LMIVPMLLRI 
DFAALGEVRR HWRGMGVTLF INWAVKPFSM AALGWLFIGH FFRPFLPADQ IDSYIAGLIL
LAAAPCTAMV FVWSNLVKGE PHFTLSQVAL NDVIMVVAFA PLVGLLLGLS AIVVPWDTLA
LSVGLYIVIP VIAAQLARRA LLAGGADAFA RVLAILQPLS LAALLATLVL LFGFQGEQIA
AQPLVILMLA APILIQVYFN AGLAYLLNRI VGEPHCVAGP SAMIGASNFF ELAVAAAISL
FGFRSGAALG TVVGVLIEVP AMLSLVYIVN ASRGWYERAE PAPAPRRAEG