Gene Mpe_A3673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3673 
Symbol 
ID4786081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3883575 
End bp3884930 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID640092256 
Productputative urea/short-chain amide transport system substrate-binding protein 
Protein accessionYP_001022861 
Protein GI124268857 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0513677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAACA AGCGACCTAT CGCGTTCGAC GCAACCCCCG ACACCCTGCG CCGGAGCCTC 
GTGCTGTCCG CCGCGGCCGC GAGCGGGCTG GGCCTGCTGC CCGGCGCGAT GCGCGACGCG
CGAGCGCAGA GCTTCCCGCC GCTCGGCAAC TTTCCGGCCG GCACCGAGGG CAACTCGGTA
TTCGTGGGCG TGTCGGTCCC GCTGACCGGG GCGTTCTCGG CCGAAGGCAA GGATCAGCAG
CTCGGCTTCG AGCTGGCGTT CGAGCACCTG AACTCGGGCA AGCTGGCCGG CAAGATCCCC
GAGCTCAAGG GCAAGGGTGT CCTGGGCAAG ACCATCACCT TCGGCGTGGT CGATACGGAG
GCCAAGGCCG AATCGGCGAT CCAGGGACAG ACCCGCTTCC TCCGCAACAA CAAGGCGATC
CTGATGACCG GCTGCTACAG CAGCGCGGTG ACCGTCGCGC TCGGCAAGCT GGCGCAGCGC
GAGAAGGTGC TCTACATGGC CGGTCCGGCC GGCTCGGACG ACGTGACCGG CAAGGACTGC
CAGCGCTACA GCTTCCGCTC GCAGCCCAGC ACCACCATGG CCTCGCGCGC GCTGGCCCCG
GTGCTGGCCG AGCGGCTGGG CAAGGGCAAG AAGGTGGCCT ACCTGGTGCC CGACTACACC
TTCGGCCACA CGCAGTTCGA ATCGATGGCG CGGCTGACCG AGCCGATGGG CTGGAAGACG
GTGTCCAAGC AGGTCTGCCC GATCGGCACC GCCGACTTCA GCACCTACCT GCTCAACATC
GCCAACAGCG GCGCCGACGT GTTCGTGAAC TGCACGGTCG GCAACGATTG CTCGGTGTCC
ATCAAGCAGG CCAAGAACTT CGGTGTCATG AAGAATGCCG CCCTGGTGGT GCCGACGCTG
CAGCCCTTCC TTGCGCAGCA GCTGGGCCCC GAGGTGACGC AGGACATCCT GGGCGTGATG
GACTTCTGGT GGTCGCTGGC CGAGAGCAAC GAGCTCGCGA AGCAGTTCGT CGACGACTTC
CAGGCCAAGC ACAACTACAA GCCCTACTGG CCGGCGCACA TCGCCTACTC GCAGATGCTG
ATCTGGGCGG TGGCGGTGGA GCGCGCCAAG ACCTTCTACC CGCCGGAGGT CATCAAGGCC
CTCGAGGCGC GCGTGCCGAT CAGGACCACG CTGGGCGACG TGATGTACCG TCCCGAGGAC
CACCAGCTGG TCCGTCCTGT CCCGGTGATG CGCGGCAAGA AGCCCTCCGA GATGAAGTCC
AAGGACGACT ACTACGAGGT GATCAAGATG ATCCCCGGCG CCGACGCCGT GACGCCGCTC
GACCAGGGCA CCTGCAAGAT GGGCGAACTC ACCTGA
 
Protein sequence
MTNKRPIAFD ATPDTLRRSL VLSAAAASGL GLLPGAMRDA RAQSFPPLGN FPAGTEGNSV 
FVGVSVPLTG AFSAEGKDQQ LGFELAFEHL NSGKLAGKIP ELKGKGVLGK TITFGVVDTE
AKAESAIQGQ TRFLRNNKAI LMTGCYSSAV TVALGKLAQR EKVLYMAGPA GSDDVTGKDC
QRYSFRSQPS TTMASRALAP VLAERLGKGK KVAYLVPDYT FGHTQFESMA RLTEPMGWKT
VSKQVCPIGT ADFSTYLLNI ANSGADVFVN CTVGNDCSVS IKQAKNFGVM KNAALVVPTL
QPFLAQQLGP EVTQDILGVM DFWWSLAESN ELAKQFVDDF QAKHNYKPYW PAHIAYSQML
IWAVAVERAK TFYPPEVIKA LEARVPIRTT LGDVMYRPED HQLVRPVPVM RGKKPSEMKS
KDDYYEVIKM IPGADAVTPL DQGTCKMGEL T