Gene Mpe_A0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0122 
SymbolcysP 
ID4784524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp124475 
End bp125488 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content65% 
IMG OID640088669 
Productthiosulfate binding protein 
Protein accessionYP_001019319 
Protein GI124265315 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.964738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000712106 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCCGTT TCGCTTCCGT CCGTGGCGCG CTCGCCGCCT CGATCCTTGT TTTGGCCGCC 
GGTGCCGCCG CCGCCAAGGA CGTGACCCTG CTCAATGTCT CCTACGACCC GACGCGCGAG
CTCTACGTCG AGTACAACGC CGCCTTCGCC AAGTACTGGA AGGGCAAGAC CGGCGACAAC
GTGACCGTGA AGCAGTCGCA CGGCGGCTCG GGCAAGCAGG CTCGCTCGGT GATCGACGGC
ATCGACGCCG ACGTGGTGAC GCTGGCGCTG GCCTACGACA TCGACGAGAT CGGCGAGAAG
GCCAAGCTGC TGCCGGCCGA CTGGCAGAAG CGCCTGAAGC ACAACAGCTC GCCCTACACC
TCCACCTACA TCTTCCTGGT GCGCAAGGGC AACCCGAAGG GCATCAAGAA CTGGGACGAC
CTGGTGAAGC CGGGCGTGTC GGTGATCACC GCGAACCCCA AGACCTCGGG TGGCGCCCGC
TGGGGCTACC TGGCGGCCTA CGGCTTCGCG CTCAAGCAGC CCGGAGGCGA CGATGCCAAG
GCGCGCGAGT TCGTCGGCAA CCTGTTCAAG AACGTGCCGG TGCTGGATTC CGGCGCCCGC
GGCTCCACGG TGACCTTCGC CGAGCGTGGT ATCGGTGACG TGCTGCTGGC CTGGGAGAAC
GAGGCTCACC TTTCGCTGAA GGAGTTCGGC GTCGACAAGT TCGACATCGT CTACCCGCCG
CTGAGCATCC TGGCCGAGCC GCCGGTGACG GTGGTCGACA AGGTGGTCGA CAAGAAGGGC
ACCCGCGACG TGGCCCAGGC CTACCTCGAG TACCTGTACA CCGCGGAGGG CCAGGAAATC
GCCGCGCGCA ACTTCTACCG GCCGATCGAC GAGAAGGTCG CGGCGAAGTA CGCGAAGAAC
TTCCCGAAGG TCAACCTGTT CACCATCGAC GAGGTGTTCG GCGGGTGGGC CAAGGCGCAG
AAGACCCACT TCGCCGACGG CGGTGTGTTC GATCAGATCT ATACGAAGAA GTAA
 
Protein sequence
MTRFASVRGA LAASILVLAA GAAAAKDVTL LNVSYDPTRE LYVEYNAAFA KYWKGKTGDN 
VTVKQSHGGS GKQARSVIDG IDADVVTLAL AYDIDEIGEK AKLLPADWQK RLKHNSSPYT
STYIFLVRKG NPKGIKNWDD LVKPGVSVIT ANPKTSGGAR WGYLAAYGFA LKQPGGDDAK
AREFVGNLFK NVPVLDSGAR GSTVTFAERG IGDVLLAWEN EAHLSLKEFG VDKFDIVYPP
LSILAEPPVT VVDKVVDKKG TRDVAQAYLE YLYTAEGQEI AARNFYRPID EKVAAKYAKN
FPKVNLFTID EVFGGWAKAQ KTHFADGGVF DQIYTKK