Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3673 |
Symbol | |
ID | 4786081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3883575 |
End bp | 3884930 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640092256 |
Product | putative urea/short-chain amide transport system substrate-binding protein |
Protein accession | YP_001022861 |
Protein GI | 124268857 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0513677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAAACA AGCGACCTAT CGCGTTCGAC GCAACCCCCG ACACCCTGCG CCGGAGCCTC GTGCTGTCCG CCGCGGCCGC GAGCGGGCTG GGCCTGCTGC CCGGCGCGAT GCGCGACGCG CGAGCGCAGA GCTTCCCGCC GCTCGGCAAC TTTCCGGCCG GCACCGAGGG CAACTCGGTA TTCGTGGGCG TGTCGGTCCC GCTGACCGGG GCGTTCTCGG CCGAAGGCAA GGATCAGCAG CTCGGCTTCG AGCTGGCGTT CGAGCACCTG AACTCGGGCA AGCTGGCCGG CAAGATCCCC GAGCTCAAGG GCAAGGGTGT CCTGGGCAAG ACCATCACCT TCGGCGTGGT CGATACGGAG GCCAAGGCCG AATCGGCGAT CCAGGGACAG ACCCGCTTCC TCCGCAACAA CAAGGCGATC CTGATGACCG GCTGCTACAG CAGCGCGGTG ACCGTCGCGC TCGGCAAGCT GGCGCAGCGC GAGAAGGTGC TCTACATGGC CGGTCCGGCC GGCTCGGACG ACGTGACCGG CAAGGACTGC CAGCGCTACA GCTTCCGCTC GCAGCCCAGC ACCACCATGG CCTCGCGCGC GCTGGCCCCG GTGCTGGCCG AGCGGCTGGG CAAGGGCAAG AAGGTGGCCT ACCTGGTGCC CGACTACACC TTCGGCCACA CGCAGTTCGA ATCGATGGCG CGGCTGACCG AGCCGATGGG CTGGAAGACG GTGTCCAAGC AGGTCTGCCC GATCGGCACC GCCGACTTCA GCACCTACCT GCTCAACATC GCCAACAGCG GCGCCGACGT GTTCGTGAAC TGCACGGTCG GCAACGATTG CTCGGTGTCC ATCAAGCAGG CCAAGAACTT CGGTGTCATG AAGAATGCCG CCCTGGTGGT GCCGACGCTG CAGCCCTTCC TTGCGCAGCA GCTGGGCCCC GAGGTGACGC AGGACATCCT GGGCGTGATG GACTTCTGGT GGTCGCTGGC CGAGAGCAAC GAGCTCGCGA AGCAGTTCGT CGACGACTTC CAGGCCAAGC ACAACTACAA GCCCTACTGG CCGGCGCACA TCGCCTACTC GCAGATGCTG ATCTGGGCGG TGGCGGTGGA GCGCGCCAAG ACCTTCTACC CGCCGGAGGT CATCAAGGCC CTCGAGGCGC GCGTGCCGAT CAGGACCACG CTGGGCGACG TGATGTACCG TCCCGAGGAC CACCAGCTGG TCCGTCCTGT CCCGGTGATG CGCGGCAAGA AGCCCTCCGA GATGAAGTCC AAGGACGACT ACTACGAGGT GATCAAGATG ATCCCCGGCG CCGACGCCGT GACGCCGCTC GACCAGGGCA CCTGCAAGAT GGGCGAACTC ACCTGA
|
Protein sequence | MTNKRPIAFD ATPDTLRRSL VLSAAAASGL GLLPGAMRDA RAQSFPPLGN FPAGTEGNSV FVGVSVPLTG AFSAEGKDQQ LGFELAFEHL NSGKLAGKIP ELKGKGVLGK TITFGVVDTE AKAESAIQGQ TRFLRNNKAI LMTGCYSSAV TVALGKLAQR EKVLYMAGPA GSDDVTGKDC QRYSFRSQPS TTMASRALAP VLAERLGKGK KVAYLVPDYT FGHTQFESMA RLTEPMGWKT VSKQVCPIGT ADFSTYLLNI ANSGADVFVN CTVGNDCSVS IKQAKNFGVM KNAALVVPTL QPFLAQQLGP EVTQDILGVM DFWWSLAESN ELAKQFVDDF QAKHNYKPYW PAHIAYSQML IWAVAVERAK TFYPPEVIKA LEARVPIRTT LGDVMYRPED HQLVRPVPVM RGKKPSEMKS KDDYYEVIKM IPGADAVTPL DQGTCKMGEL T
|
| |