Gene Mpe_A3589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3589 
Symbol 
ID4786174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3799036 
End bp3800250 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID640092171 
Productbranched-chain amino acid ABC transporter, permease protein 
Protein accessionYP_001022777 
Protein GI124268773 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID[TIGR03408] urea ABC transporter, permease protein UrtC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGC TGCTCGCGTG GTCTAAGAAC TCGGGGCTCG GCAGCCTGCT GCTGCTGATC 
CTGCTGCTCG CGGTCGTGCT GCCGCTCACG CTGGACATCT TCCGACTCAA CCTGGTCGGC
AAGTACCTCA CCTATGCCTT CGTCGCTGTC GGCCTGGTGA TGGTGTGGGG CTACGGCGGC
GTGCTGAGCC TGGGCCAGGG CGTGTTCTTC GGGCTCGGCG GCTACGCGAT GGCGATGTTC
CTGAAGCTCG AGGCCTCCGA CCCGGAGACC ACCAAGATCC AGACCACGCC GGGCATCCCC
GACTTCATGG ACTGGAACCA GATCACCGAG CTGCCGATGA TGTGGCTGCC GTTCAAGAGC
CTGCCGCTGA GCCTCATTCT GGTGATCGCG GTGCCCACGC TGCTGGCCTG GATCATCAGC
TTCGCGATGT TCAAGCGCCG CGTCGGCGGC GTGTACTTCG CGATCATCAC GCAGGCGGTC
GCGCTGATCC TCACGGTGCT GATCATCGGC CAGCAGGGCT ACACCGGCGG CGTCAACGGC
ATGACCGACC TGAAGACCGT GCTCGGCTGG GACACGCGCA CCGACAGCGC CAAGTACATC
CTCTACTACC TTTGCGTGGC GCTGCTGGTG GCCAGCATCC TGCTGTGCCG CTGGATCCAG
ACCAGCAAGC TCGGCACCCT GCTGCTGGCG ATGCGCGACA AGGAAGACCG GGTGCGCTTC
TCCGGCTACG ACGTCTCCAA CTTCAAGATC TTCACCTTCT GCCTGGCCGC GGCGCTGTCG
GGCATCGGCG GCGCGCTGTT CTCTCTGCAG GTGGGCTTCA TGTCGCCCAG CTTCGTCGGC
ATCGTGCCGT CGATCGAGAT GGTGATCTTC GCGGCGGTCG GCGGGCGCAT GAGCCTGGTC
GGCGCCGTCT ACGGCACGCT GCTGGTCAAC GCCGGCAAGA CCTTCTTCTC GGAGAGCTTC
CCGGACCTGT GGCTGTTCCT GATGGCCGCG CTGTTCATCG GCGTGACGCT GGCCTTCCCG
ATGGGGCTGG CCGGACTGTG GGAGAGCCAC GTGAAGCCGT GGTGGACGAA ACGCCAGGCC
GACCGCCGAT CGACCCGGGA GCGCGTGGCC GCCGCGCACG CCGCCTACCC CGATGCGCCG
CCCAAGCCTG CCCGCGCCAC GCCGCCATCC GGCGATTCGA AGCTGCCCGG CGGCGTGAGC
GGCCAGCGCG CCTGA
 
Protein sequence
MKSLLAWSKN SGLGSLLLLI LLLAVVLPLT LDIFRLNLVG KYLTYAFVAV GLVMVWGYGG 
VLSLGQGVFF GLGGYAMAMF LKLEASDPET TKIQTTPGIP DFMDWNQITE LPMMWLPFKS
LPLSLILVIA VPTLLAWIIS FAMFKRRVGG VYFAIITQAV ALILTVLIIG QQGYTGGVNG
MTDLKTVLGW DTRTDSAKYI LYYLCVALLV ASILLCRWIQ TSKLGTLLLA MRDKEDRVRF
SGYDVSNFKI FTFCLAAALS GIGGALFSLQ VGFMSPSFVG IVPSIEMVIF AAVGGRMSLV
GAVYGTLLVN AGKTFFSESF PDLWLFLMAA LFIGVTLAFP MGLAGLWESH VKPWWTKRQA
DRRSTRERVA AAHAAYPDAP PKPARATPPS GDSKLPGGVS GQRA