Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0429 |
Symbol | |
ID | 5835171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 472590 |
End bp | 475940 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641366213 |
Product | tetratricopeptide TPR_4 protein |
Protein accession | YP_001637922 |
Protein GI | 163849879 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.564621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGGAC GGGCCGGTCA GGCAAAGCGG CGTCCGGGCA GGGCTCTCCT GCGCGCCGGC TTCGCCGTCG GCCTCTGCGC CGCGGCCTCG GCGGCGGAGG CGGCCCGGCT CGTCTCGGCC AAAGGCGCGC AGCCGCCGGA GGGCTTCGGC CGCATCGTCC TGACCTTCGA CGAGCCGGTC TCCGTGAAGG CGCGGCTGTC GGGCGCGATC CTCGTGCTCA ATTTCGGCGA GGCGGTGGGG TCCGGCCCCG AGCGGATCGC GGCGGGGATG CCGGACTACG TCACCGTGGT GCGGCGCGAC CCCGACGGCT CGGCCCTGCG CCTCGCCCTG CAGCGGCCCT ACCGCGTCAA CGTGCAGGAT GCCGGCGAGC AGGTCTTCAT CGATCTCCTG CCGGAAAGCT GGAACGGCTT CCTGCCGCCG CTTCCCACCG AAGTCGTCGC CGACCTCGCC CGCCGGGCTG CCGCCGCCGA GGCCAGCCTG AAGGCGCGCA ACCCGGCTCC TGTGCCGCGT CCGCTCACCC TCGAACTCGC CCGCACCGAT GCCCGCACCC GGCTCTCCCT GCGCCTGCCG GCGGGCTCCG AGGCCGCCTT CGCCCCGGAT GGCACCGGCA CGCGGCTGAC GCTGCCGGGG GCGTGGCGCA TCGACGACCA CGCCCTCCGC GGCCGGCTCG ACCCGAATCT CGGCCGTGTC ACGGTCGAGA CGGATACCGG CGAGGCACGG ATCGTCGCGA GCCCGGCCGA GGGCGTGACA CTCTCCACCT TGCGCGACGA AGATGTCGTG GCGATCGATT TCGTCACCAA GCCGAAGACA CCCGAGACTG CGACGTCATC CGCTTCGCCT GCCGGAGCCT CGGCCGGAGT TGCGAAGGAG GTGCCGAAAG AGGCCGCGCG TCCCGCTGCC TCGAGCGCCG ACGCGTCGCC GCGGCCCGCT GCGCCTCCCG TGCTGTCGCG CAAGGCGGGC TCGGGCCTCG TCTTCCCCTT TGCCAAGCGT GTGCCGGCGG CGTTGTTCGA GCGCGGCGGC ATCGTCACCC TCGTCTTCGC CACGACCGAG CCGGTTGCCG TCCCGCCGCC CGGTGCCACC GGCCTCGTCG CGCTCGCGCC GCCGCTGCGG AGCGGCGGCT TCACCATCCT TCGCTTCACT GCGCCCGCGG GACGGCTCGT CGATCTTCTG CCCGTCACGG AGCCCGCCGG TTGGGAGCTG GCGACCGGCG ACGGCCTTTC CCCGAGCGAG AGCCTGACCG CGCAGCGCGC CCCGACGGCC CAGGGCCGGC TCGGCGTCAG CGTGCGGCTG CCCCAGGCCG GCCCGGCCGG CTGGCTCGAT CTCGACGGTG AGCGCATCGC CGTCGTCACC ACCGACGGCA GCCGCCCTGC CGGCGTGGTC AAGGCGCAGC GCTTCGTCGA GTTCGAGCTG ATTCCGAGCC GGCTCGGCCT CGCCGTTCTC GCCAGTGCCG ACGACCTCAT CGTGCGGCCG GATCTCGACG GCGTGACGAT CGGCCGTGAG AACGCGAGAG AGGGGACAAG GGAAGGCCGC GACGGCGGGC TTTCCGTGTC GGGCATCTCC CGCCCCGCCG ATCCGCCCGT GGGGGCGGTC ACGGAGCTGG CGGTCGATCG CGACGCGTGG GAGAAGGCGC AGCGCGGCGA TGTGCGCGCG ACGCTGCGGG AGGGCCTCGC CGCCGCCGTC GAGGCCCATC GCCGCGACCG CGGCGGCGCC CGCCTCGGTC TCGCCCGGGC GATGATGGCC AACGATCTCG ACGTGGAGGC GCTCGGTGCG CTGACCGCCG CCGCCGCCGA GGATGTGGTC ATCGACGGCG ATCGGCAGAC GGCGCTGATG CGCGGCATCC TGCTCGCCCG GATCGGGCGG GCCGAAGAGG CGCGTAAACT CCTCTCCGAC GAGCGGCTCG CCACCAATCC GGAAGCCCGG CTCTGGCGCG GCTATGCCGA CGCCCTGGTC GGCCGGTGGA ACGAGGCGGC CGTCGCGCTG CGTGCCGGCG AATCGGTCTT GGAGCGCTAC CCCGAACCCC TCGCATCGCT GTTCCACGCC GCCGCCGCCG AGGCCGCGGT CGAGACCGGC GACTGGGAAG CGGTCACGCG TGAATCGATC GCCGCCACCC GTGCCGCCAC CGATTACACC CGCGACCGTC TGACCCTGCT GCGGGCGAAG ATGGACGAGG CGACCGGCCG CGGCGCGGCG GCGCTGGCGG CCTACGAGAT TCTGCGGAAT CAAGCGCCTT GGCCGCTGGC GGCGGAGGCG ACCCTGCGGG CGGCCGTCCT CGGCCATACC CTCGGCAAGA CGCCCCTGCC CGAAGCGATC GATCAGTTGG AGGTTCTGGC CCTGACCTGG CACGGCGGCC CGACCGAGAT CGGGACGCTC GGCGCGCTCG GCGGCCTCTA CGAGGAGGCC GGGCGCTGGC GCAAGATCTT CACCACCGCC CGCCGCGCCA ACGCGCTCAG CCCCGAGGCG CCGATCGCCC GCGCGCTGCA CGAGCGGGCC CTGGCCGTGT TCGAGGATCT GTTCCTGGGC GCGCGCGGCG AGCGGCTCGG CGGTGTCGAG GCGCTGGCGC TGTATTTCGA CTTCAAGGAT TTCGCGCCCG CCGGCCGGCG CGCCGACGAG ATCGTACGGC GCCTCGCCGA CCGCCTCGTC GCCCTCGATC TGCTCGAGTC CGCGGACGAG CTGCTGCAGT ACCAGATCGA TCACCGGCTT GAGGGCACGG CCCGCTCCTC GGTCTCGGCG CGGCTCGCCA CGATCCGGCT GATGGAGGGC AAGCCGCTCC AGGCGCTCCA GACGCTCGAC GCGACGCACC TGCCCGAGCT GCCCGAGGAT GTGCGCCGCG CCCGCGCGAT GCTGCGCGCC CGCGCCCTGT CGGATCTCTC CCGCACCGAT CTGGCTCTGG AGACCGTCGA GGGCGAGACC GGCGCCGATG CCGAGCGTCT GCGAGCGGAC ATCCTGTGGG CGGCCCGGCG CTGGCGCGAG GCGGGCGAGG CGCACGAGAT GATCCTCGGG CCGGCATGGC GCTCTGGAAA GCCCCTCGAC GACACGGCGC GAGCCGACGT CATCCGCGCC GGCATCGCCT ACGGGCTCGC GGGCGAATCC CTCGGGCTCG AACGGCTGAA GGCGAAGTTC GCGGGGCCGA TGGCCGAGAG CGCGGATGCC CGCACCTTCG CCATGCTGAC GCGACCCGAT GCGCCCCGCT CCGCCGCCTT CCGCGATGCG GCCCTGCGGG CGACCAAGGC GGAGACGCTT GCCGCCTTCC TCTCGGAGTA CCGCAAGCGC TACCCCGACA GCGCCGTTCC GGAACCCGGC AGTGCCGCGA CGGGCAACCG CGCCGAGGCG CCGTCCCCGC CGCCGGGCTG A
|
Protein sequence | MGGRAGQAKR RPGRALLRAG FAVGLCAAAS AAEAARLVSA KGAQPPEGFG RIVLTFDEPV SVKARLSGAI LVLNFGEAVG SGPERIAAGM PDYVTVVRRD PDGSALRLAL QRPYRVNVQD AGEQVFIDLL PESWNGFLPP LPTEVVADLA RRAAAAEASL KARNPAPVPR PLTLELARTD ARTRLSLRLP AGSEAAFAPD GTGTRLTLPG AWRIDDHALR GRLDPNLGRV TVETDTGEAR IVASPAEGVT LSTLRDEDVV AIDFVTKPKT PETATSSASP AGASAGVAKE VPKEAARPAA SSADASPRPA APPVLSRKAG SGLVFPFAKR VPAALFERGG IVTLVFATTE PVAVPPPGAT GLVALAPPLR SGGFTILRFT APAGRLVDLL PVTEPAGWEL ATGDGLSPSE SLTAQRAPTA QGRLGVSVRL PQAGPAGWLD LDGERIAVVT TDGSRPAGVV KAQRFVEFEL IPSRLGLAVL ASADDLIVRP DLDGVTIGRE NAREGTREGR DGGLSVSGIS RPADPPVGAV TELAVDRDAW EKAQRGDVRA TLREGLAAAV EAHRRDRGGA RLGLARAMMA NDLDVEALGA LTAAAAEDVV IDGDRQTALM RGILLARIGR AEEARKLLSD ERLATNPEAR LWRGYADALV GRWNEAAVAL RAGESVLERY PEPLASLFHA AAAEAAVETG DWEAVTRESI AATRAATDYT RDRLTLLRAK MDEATGRGAA ALAAYEILRN QAPWPLAAEA TLRAAVLGHT LGKTPLPEAI DQLEVLALTW HGGPTEIGTL GALGGLYEEA GRWRKIFTTA RRANALSPEA PIARALHERA LAVFEDLFLG ARGERLGGVE ALALYFDFKD FAPAGRRADE IVRRLADRLV ALDLLESADE LLQYQIDHRL EGTARSSVSA RLATIRLMEG KPLQALQTLD ATHLPELPED VRRARAMLRA RALSDLSRTD LALETVEGET GADAERLRAD ILWAARRWRE AGEAHEMILG PAWRSGKPLD DTARADVIRA GIAYGLAGES LGLERLKAKF AGPMAESADA RTFAMLTRPD APRSAAFRDA ALRATKAETL AAFLSEYRKR YPDSAVPEPG SAATGNRAEA PSPPPG
|
| |