Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3684 |
Symbol | |
ID | 5832471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4074451 |
End bp | 4078653 |
Gene Length | 4203 bp |
Protein Length | 1400 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641369476 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001641131 |
Protein GI | 163853088 |
COG category | [S] Function unknown |
COG ID | [COG4641] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAT TTGGGTCCCT CGATTTTGTC GACAAGAGTC ATATTTACGG CTGGATAAAA AAATCGGATT CGGAAGATCC CGTTAATATT GATGTGTTTG CGGACGGCAT CGCGGTCGCT ATAGGTGTTA CTGCAGACAT ATATCGGGAG GATCTGCAAA AAGCTGGGAT CGGACGTGGC TTCCATGGCT TTGTGGTTCC GATTATTGGC GAGGTCAAGC CGGATGGCGT CCTCATATCC GTAAGGCTGT CCGGAACGCA GGCTGAGGTT CTAGAGGAAG TCCTTAGGCA TTCTGCGCCA CAGGAAGTGC ATCAGACATC CAATCGAGAG AGTCTCTCGG AGAGGTCTGA GGACGCTCAA CCGGAAGCAA TTACTCCAAT TGAGACTAAC TGGACGGCCC CTCAAGGTGT AGTGTCCTCA CTCGAAGGCA GTGTGCTGAC CGGTCGATGC CTGGCTACGG CAGACGAACT CGACGGCGAG GGCTTCGCAC TCATCGTCCG AGTCGGCGGG AGGATCGTGA TCGAGGGGGT GAGCCAACCG GCTATTGTGA CTGAAGACGG AAAGCGCTTG GCCGAGTTCG TGCTCGATTT AGTTGAGATG GACACCCTCG CGCGCGAGCA CACCATCGAG ACGCTAAACG ATTGGCTGGC GCTCGGAGCA CCAGCTGAGT CGCCGATCGT CGTGAGCCGA CAAGGAACAC TGGACTTTTT TTGGGCGGAG GCTGAGCGTG TTCCGTCCGC CGAGACTGCA CAGCAGTTAG TCGATATCTT AGTAGGACTC AAAGCGCCTA GCCTTCTAGA CAAGGCTCTA CGGTTTCGCA GCGCCCTCGA GCAATGCGGG CTCTGGATGT CTCCGCCGCT CAGCGACAAG GAGAAGATAG CGGCAGCGGA CGACCCAGCC TTGCTGTGCT TCGACACGAC TTACTTCGCC GAGACCTATC CGTCTCACTT CAGGACAGGT AGCCACGCTC GAGATGGGAT TGAGATATTC AGGGCTCTGT CTGCAGACAG GGCCATGCAG CCGAACCTTC TGTTCGCTGA ACCATGGTAT CGACGCCATC GCAGGGATAT TGCCCAGGCA ATCGCCTGCG GTGAGGCACG GAGCGGGCTT GATCACTTCT TGGAGGTCGG CATGTCTTCC GGACTGTCTC CTTGCGAGTG GATCTCCCTA GCAGTCGATT ATGAGGCTCC CTTCGAACCG GACGCCCGCC ATCCTGTTGT CGAATGGCTG CTTAACCCAG TCGGCATGGT GCCGAAAGAG ATAACGATTC GGTGGCCTGA TGCCACGCCT AGCCCGAGTG CAAAGCTTAC GGAACTGCGC AACATCCTGC AGCGCGCTGT CCACAATCTA TCTTATGACA GCGGCGAAGC CGTTAGAATT CCTCTACTTA GCGACCGCCT ACGCGACAAA AGTCTGGTTA TAATTAATAC CAACACTGCA ACCCGCAACT ATCACATTAC ACGCTCAATT TATCACGACG CACGCGCGCT TCTTGGTGGG GAACGAGTCG CAATAGCGAC TTACGACAAT CTAGTTGAGA CATGCGCCAG TCTCGACAAG CCAATCTTGC TCTGTATAGA TGGTCAGCGG ATCAACTCAG ATATCATTGC AGCGGCGAAT CGCTACTGCA CAGCCTCAGC CCTATGGACG TTCGATGACC CTTACAATTT GAACACTCAT CTTAAGTTTG CAGAACTTTT TGACCTTGTT TGTACAAACG ATCTATCCTG CCTGCCGGTA TACGGTGGTC GCGGGTTTTA CTTGCCACTC GCAGCCCCGA GCACATTGCT AAACGAGGAA ACAGATACCG CTGACCAAGC GTTTGACGTC TTCTTCTGTG GCACAGCTTG GCCGAACCGG GTAGTATTGT TGAATAAGTT GATGCGTGAT CGTCCGCATC TCCGCTACAA GTTTGCTCTG ACATACAATG CTTCAGTTCC GGCTCTTCCG TTGATGAAGC CACCGAGCTC TTACATCCAG TCGCTCTCCT TCTCTGACTT CATCCGCTAC GCAAAGCGGA GCAGAATTAC TTTGTCTCTT CATCGGAACT TTGCTGGAAC AGACGTGTTC AGTACGTCAA GCAATCCCGG CCCACGTGTG TTCGAGGTCG CGGCAGCAGG CGCCTATCAA ATTTCTGAGA ATGGCGGACC GGGATTTGAG GCGTTACTTC CGTCCCACCT CCTGACTTGT TACGATGAAT ATGACGAACT GCTCAATCTA ATTGATAAAG CATTAGCCGA GCCAAAGGAG CGGGCGGCCT CGACTCAGCG TCTTAGGAAC CGCGTTGCCT CAGCTCACAC CTACCGGCAC AGGCTGCTGG TCTTACTGGA AGAACTTGCA CGTTCCGCTC CTCCCGAAAG CGCAGATCCC ATCACGGATA CGCTAGAACG TCCGAAATTG CTATATGTCG TGCACAATAC CGTCAGGCAA CCGCCATTCG GCGGCCTAGA AGTTCATCAG GACATCCTAG CGCAGAATCT CAAAAAGCAC TACGAAATAT ATTTTTTCTA CTCTGCTGAA ACCGCGCCAG GAATGCGAAA GAGTATCTTG GCCGACTGCA ACTACCAGAC CTTGGAGACG AGCATTAATG TTGTAAAAGT CGGGCATGCT GATCTGGAGA ACCCAGAGCT AGAAAAGTTT TTTGGCCACT GCCTCGCTAA GTACAATTTC CAAATGGTTC ACTTTTTTCA TTTCATCAAC CAATCTCCGT CCCTTGCCCA TATTGCGCGC AGCTATGGTA TACCCTATGG TATTTCGTTC CACGACTTTT ATACGGCGTG TAGGCAGTTC AACCTGCTCA ATTTTATTAA TCGCTACTGC AGCAATGAGC GCACAAAGCA ATCGGACTGC GACATTTGCT TGAAAAAAAT ACATAAATTC CCCGAGCACA GTCAGCTGAT CCGCAGAGAC TACTATGGGG ACATTATCGG AAAAGCTGCG ACGCTAATGT TCGTGTCCAG CTCCGCGCGC GACATCCACC AATCGATTTA TCCGCAAACT TCACTCGCAG GGGATAGCCG GGTCCATGGT GCGCCCATCC CTAATTCGAA TTGGGCTTTG ACTCGCCAAA CTGAACACTC GGATAGTCTC ACCGTGCGTC CGACTCGTTT TGTAGTCCTT GGGAACTTCG ATGAACACAA GGGTGCGACC TATTTGCTCG ATGCAATTGG CGCATGCAAA GACATCGACG CCGAATTTCA CTTCCACGGT CACGCGCATG AAAAACTCCG TGAACGCTAT AAAGTCGAGA TTGGCGAGCG TGCTATCTTC CACGGCAGGT ACAAGCCTGG CGAGGTGAAT GTTAGGAACT ATGACTTCTC ATTGCATTTG TCGATTTGGC CGGAGACCTA CTGCCAAACC CTGTCAGAAG CCTGGGCTGC CCGCGTCGTG CCAATTGTCA CCAACATCGG GGCTCTTGGG CAGCGAGTTG AACACGGCGT CAACGGTTAC AAAGTTGACC CAACGCGTCC GGCGACGCTC ACTCGTCTGT TAGAAGCAAT AACGAATGAT AGGCAACGCT ATCTTGCGCT GCAAAAGAAC ATCACCGACG ATCTTTTCCT CGATCAAGCC GAACATATGG CCCTCTATGA CGAGGCCTAT CGACGAGTTC TCGCGCGCTC GAAGCGCACG ACTCCACGTA AAGGTAGTCT GATCCCATTC CGCGGCACAA CAATGGAGAC GCTGCAGCGC AGGCGGCGCA CGCCATTCTG GAATCAAGGA AAAGGCAAGC ATCCGCCCAG CTTAGAAGTC GTGCCTAACG CAGCAGCCCT TTCGCAGATC GGTCAGCTGC GCCCGTTCCA AGGCACGATC GAGATTGTAC AAGGCGTGTT TGATCTGGCC GGGAACCATG ATATGAACCA AGCTGGAAGC GATCCGCTCA TAATAAAGAA TGCTCAGGCG CTGCCGGTGC AAGGATGGGT AGCCCGGCCC GCCTCCGGTA GCCATCTGCC GGCCGCAATC CTGCGGACGC ACAAGGGCGT ATTTGTTCAC ATCCTTAATG TCGCCGAGCG TCCTGACGTG GCCGAGGCGA TCTCAGTGCC TGGTGCCAGA TTGTGGGGAA TTGAGGGACA CATCCGAATG CTGAGCCCGG ACGAAGTAGT TACTGGCCCA GCGGAGCTTA GCCTCGGCTG GCTTGACCTA GATGCATGTG TGGCCCATTC ATGCCAAAAA TTCGTGCCAA TTCTCGGAGT TCTCGGTGGT TGA
|
Protein sequence | MTAFGSLDFV DKSHIYGWIK KSDSEDPVNI DVFADGIAVA IGVTADIYRE DLQKAGIGRG FHGFVVPIIG EVKPDGVLIS VRLSGTQAEV LEEVLRHSAP QEVHQTSNRE SLSERSEDAQ PEAITPIETN WTAPQGVVSS LEGSVLTGRC LATADELDGE GFALIVRVGG RIVIEGVSQP AIVTEDGKRL AEFVLDLVEM DTLAREHTIE TLNDWLALGA PAESPIVVSR QGTLDFFWAE AERVPSAETA QQLVDILVGL KAPSLLDKAL RFRSALEQCG LWMSPPLSDK EKIAAADDPA LLCFDTTYFA ETYPSHFRTG SHARDGIEIF RALSADRAMQ PNLLFAEPWY RRHRRDIAQA IACGEARSGL DHFLEVGMSS GLSPCEWISL AVDYEAPFEP DARHPVVEWL LNPVGMVPKE ITIRWPDATP SPSAKLTELR NILQRAVHNL SYDSGEAVRI PLLSDRLRDK SLVIINTNTA TRNYHITRSI YHDARALLGG ERVAIATYDN LVETCASLDK PILLCIDGQR INSDIIAAAN RYCTASALWT FDDPYNLNTH LKFAELFDLV CTNDLSCLPV YGGRGFYLPL AAPSTLLNEE TDTADQAFDV FFCGTAWPNR VVLLNKLMRD RPHLRYKFAL TYNASVPALP LMKPPSSYIQ SLSFSDFIRY AKRSRITLSL HRNFAGTDVF STSSNPGPRV FEVAAAGAYQ ISENGGPGFE ALLPSHLLTC YDEYDELLNL IDKALAEPKE RAASTQRLRN RVASAHTYRH RLLVLLEELA RSAPPESADP ITDTLERPKL LYVVHNTVRQ PPFGGLEVHQ DILAQNLKKH YEIYFFYSAE TAPGMRKSIL ADCNYQTLET SINVVKVGHA DLENPELEKF FGHCLAKYNF QMVHFFHFIN QSPSLAHIAR SYGIPYGISF HDFYTACRQF NLLNFINRYC SNERTKQSDC DICLKKIHKF PEHSQLIRRD YYGDIIGKAA TLMFVSSSAR DIHQSIYPQT SLAGDSRVHG APIPNSNWAL TRQTEHSDSL TVRPTRFVVL GNFDEHKGAT YLLDAIGACK DIDAEFHFHG HAHEKLRERY KVEIGERAIF HGRYKPGEVN VRNYDFSLHL SIWPETYCQT LSEAWAARVV PIVTNIGALG QRVEHGVNGY KVDPTRPATL TRLLEAITND RQRYLALQKN ITDDLFLDQA EHMALYDEAY RRVLARSKRT TPRKGSLIPF RGTTMETLQR RRRTPFWNQG KGKHPPSLEV VPNAAALSQI GQLRPFQGTI EIVQGVFDLA GNHDMNQAGS DPLIIKNAQA LPVQGWVARP ASGSHLPAAI LRTHKGVFVH ILNVAERPDV AEAISVPGAR LWGIEGHIRM LSPDEVVTGP AELSLGWLDL DACVAHSCQK FVPILGVLGG
|
| |