Gene Mext_0429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0429 
Symbol 
ID5835171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp472590 
End bp475940 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content74% 
IMG OID641366213 
Producttetratricopeptide TPR_4 protein 
Protein accessionYP_001637922 
Protein GI163849879 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.564621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGAC GGGCCGGTCA GGCAAAGCGG CGTCCGGGCA GGGCTCTCCT GCGCGCCGGC 
TTCGCCGTCG GCCTCTGCGC CGCGGCCTCG GCGGCGGAGG CGGCCCGGCT CGTCTCGGCC
AAAGGCGCGC AGCCGCCGGA GGGCTTCGGC CGCATCGTCC TGACCTTCGA CGAGCCGGTC
TCCGTGAAGG CGCGGCTGTC GGGCGCGATC CTCGTGCTCA ATTTCGGCGA GGCGGTGGGG
TCCGGCCCCG AGCGGATCGC GGCGGGGATG CCGGACTACG TCACCGTGGT GCGGCGCGAC
CCCGACGGCT CGGCCCTGCG CCTCGCCCTG CAGCGGCCCT ACCGCGTCAA CGTGCAGGAT
GCCGGCGAGC AGGTCTTCAT CGATCTCCTG CCGGAAAGCT GGAACGGCTT CCTGCCGCCG
CTTCCCACCG AAGTCGTCGC CGACCTCGCC CGCCGGGCTG CCGCCGCCGA GGCCAGCCTG
AAGGCGCGCA ACCCGGCTCC TGTGCCGCGT CCGCTCACCC TCGAACTCGC CCGCACCGAT
GCCCGCACCC GGCTCTCCCT GCGCCTGCCG GCGGGCTCCG AGGCCGCCTT CGCCCCGGAT
GGCACCGGCA CGCGGCTGAC GCTGCCGGGG GCGTGGCGCA TCGACGACCA CGCCCTCCGC
GGCCGGCTCG ACCCGAATCT CGGCCGTGTC ACGGTCGAGA CGGATACCGG CGAGGCACGG
ATCGTCGCGA GCCCGGCCGA GGGCGTGACA CTCTCCACCT TGCGCGACGA AGATGTCGTG
GCGATCGATT TCGTCACCAA GCCGAAGACA CCCGAGACTG CGACGTCATC CGCTTCGCCT
GCCGGAGCCT CGGCCGGAGT TGCGAAGGAG GTGCCGAAAG AGGCCGCGCG TCCCGCTGCC
TCGAGCGCCG ACGCGTCGCC GCGGCCCGCT GCGCCTCCCG TGCTGTCGCG CAAGGCGGGC
TCGGGCCTCG TCTTCCCCTT TGCCAAGCGT GTGCCGGCGG CGTTGTTCGA GCGCGGCGGC
ATCGTCACCC TCGTCTTCGC CACGACCGAG CCGGTTGCCG TCCCGCCGCC CGGTGCCACC
GGCCTCGTCG CGCTCGCGCC GCCGCTGCGG AGCGGCGGCT TCACCATCCT TCGCTTCACT
GCGCCCGCGG GACGGCTCGT CGATCTTCTG CCCGTCACGG AGCCCGCCGG TTGGGAGCTG
GCGACCGGCG ACGGCCTTTC CCCGAGCGAG AGCCTGACCG CGCAGCGCGC CCCGACGGCC
CAGGGCCGGC TCGGCGTCAG CGTGCGGCTG CCCCAGGCCG GCCCGGCCGG CTGGCTCGAT
CTCGACGGTG AGCGCATCGC CGTCGTCACC ACCGACGGCA GCCGCCCTGC CGGCGTGGTC
AAGGCGCAGC GCTTCGTCGA GTTCGAGCTG ATTCCGAGCC GGCTCGGCCT CGCCGTTCTC
GCCAGTGCCG ACGACCTCAT CGTGCGGCCG GATCTCGACG GCGTGACGAT CGGCCGTGAG
AACGCGAGAG AGGGGACAAG GGAAGGCCGC GACGGCGGGC TTTCCGTGTC GGGCATCTCC
CGCCCCGCCG ATCCGCCCGT GGGGGCGGTC ACGGAGCTGG CGGTCGATCG CGACGCGTGG
GAGAAGGCGC AGCGCGGCGA TGTGCGCGCG ACGCTGCGGG AGGGCCTCGC CGCCGCCGTC
GAGGCCCATC GCCGCGACCG CGGCGGCGCC CGCCTCGGTC TCGCCCGGGC GATGATGGCC
AACGATCTCG ACGTGGAGGC GCTCGGTGCG CTGACCGCCG CCGCCGCCGA GGATGTGGTC
ATCGACGGCG ATCGGCAGAC GGCGCTGATG CGCGGCATCC TGCTCGCCCG GATCGGGCGG
GCCGAAGAGG CGCGTAAACT CCTCTCCGAC GAGCGGCTCG CCACCAATCC GGAAGCCCGG
CTCTGGCGCG GCTATGCCGA CGCCCTGGTC GGCCGGTGGA ACGAGGCGGC CGTCGCGCTG
CGTGCCGGCG AATCGGTCTT GGAGCGCTAC CCCGAACCCC TCGCATCGCT GTTCCACGCC
GCCGCCGCCG AGGCCGCGGT CGAGACCGGC GACTGGGAAG CGGTCACGCG TGAATCGATC
GCCGCCACCC GTGCCGCCAC CGATTACACC CGCGACCGTC TGACCCTGCT GCGGGCGAAG
ATGGACGAGG CGACCGGCCG CGGCGCGGCG GCGCTGGCGG CCTACGAGAT TCTGCGGAAT
CAAGCGCCTT GGCCGCTGGC GGCGGAGGCG ACCCTGCGGG CGGCCGTCCT CGGCCATACC
CTCGGCAAGA CGCCCCTGCC CGAAGCGATC GATCAGTTGG AGGTTCTGGC CCTGACCTGG
CACGGCGGCC CGACCGAGAT CGGGACGCTC GGCGCGCTCG GCGGCCTCTA CGAGGAGGCC
GGGCGCTGGC GCAAGATCTT CACCACCGCC CGCCGCGCCA ACGCGCTCAG CCCCGAGGCG
CCGATCGCCC GCGCGCTGCA CGAGCGGGCC CTGGCCGTGT TCGAGGATCT GTTCCTGGGC
GCGCGCGGCG AGCGGCTCGG CGGTGTCGAG GCGCTGGCGC TGTATTTCGA CTTCAAGGAT
TTCGCGCCCG CCGGCCGGCG CGCCGACGAG ATCGTACGGC GCCTCGCCGA CCGCCTCGTC
GCCCTCGATC TGCTCGAGTC CGCGGACGAG CTGCTGCAGT ACCAGATCGA TCACCGGCTT
GAGGGCACGG CCCGCTCCTC GGTCTCGGCG CGGCTCGCCA CGATCCGGCT GATGGAGGGC
AAGCCGCTCC AGGCGCTCCA GACGCTCGAC GCGACGCACC TGCCCGAGCT GCCCGAGGAT
GTGCGCCGCG CCCGCGCGAT GCTGCGCGCC CGCGCCCTGT CGGATCTCTC CCGCACCGAT
CTGGCTCTGG AGACCGTCGA GGGCGAGACC GGCGCCGATG CCGAGCGTCT GCGAGCGGAC
ATCCTGTGGG CGGCCCGGCG CTGGCGCGAG GCGGGCGAGG CGCACGAGAT GATCCTCGGG
CCGGCATGGC GCTCTGGAAA GCCCCTCGAC GACACGGCGC GAGCCGACGT CATCCGCGCC
GGCATCGCCT ACGGGCTCGC GGGCGAATCC CTCGGGCTCG AACGGCTGAA GGCGAAGTTC
GCGGGGCCGA TGGCCGAGAG CGCGGATGCC CGCACCTTCG CCATGCTGAC GCGACCCGAT
GCGCCCCGCT CCGCCGCCTT CCGCGATGCG GCCCTGCGGG CGACCAAGGC GGAGACGCTT
GCCGCCTTCC TCTCGGAGTA CCGCAAGCGC TACCCCGACA GCGCCGTTCC GGAACCCGGC
AGTGCCGCGA CGGGCAACCG CGCCGAGGCG CCGTCCCCGC CGCCGGGCTG A
 
Protein sequence
MGGRAGQAKR RPGRALLRAG FAVGLCAAAS AAEAARLVSA KGAQPPEGFG RIVLTFDEPV 
SVKARLSGAI LVLNFGEAVG SGPERIAAGM PDYVTVVRRD PDGSALRLAL QRPYRVNVQD
AGEQVFIDLL PESWNGFLPP LPTEVVADLA RRAAAAEASL KARNPAPVPR PLTLELARTD
ARTRLSLRLP AGSEAAFAPD GTGTRLTLPG AWRIDDHALR GRLDPNLGRV TVETDTGEAR
IVASPAEGVT LSTLRDEDVV AIDFVTKPKT PETATSSASP AGASAGVAKE VPKEAARPAA
SSADASPRPA APPVLSRKAG SGLVFPFAKR VPAALFERGG IVTLVFATTE PVAVPPPGAT
GLVALAPPLR SGGFTILRFT APAGRLVDLL PVTEPAGWEL ATGDGLSPSE SLTAQRAPTA
QGRLGVSVRL PQAGPAGWLD LDGERIAVVT TDGSRPAGVV KAQRFVEFEL IPSRLGLAVL
ASADDLIVRP DLDGVTIGRE NAREGTREGR DGGLSVSGIS RPADPPVGAV TELAVDRDAW
EKAQRGDVRA TLREGLAAAV EAHRRDRGGA RLGLARAMMA NDLDVEALGA LTAAAAEDVV
IDGDRQTALM RGILLARIGR AEEARKLLSD ERLATNPEAR LWRGYADALV GRWNEAAVAL
RAGESVLERY PEPLASLFHA AAAEAAVETG DWEAVTRESI AATRAATDYT RDRLTLLRAK
MDEATGRGAA ALAAYEILRN QAPWPLAAEA TLRAAVLGHT LGKTPLPEAI DQLEVLALTW
HGGPTEIGTL GALGGLYEEA GRWRKIFTTA RRANALSPEA PIARALHERA LAVFEDLFLG
ARGERLGGVE ALALYFDFKD FAPAGRRADE IVRRLADRLV ALDLLESADE LLQYQIDHRL
EGTARSSVSA RLATIRLMEG KPLQALQTLD ATHLPELPED VRRARAMLRA RALSDLSRTD
LALETVEGET GADAERLRAD ILWAARRWRE AGEAHEMILG PAWRSGKPLD DTARADVIRA
GIAYGLAGES LGLERLKAKF AGPMAESADA RTFAMLTRPD APRSAAFRDA ALRATKAETL
AAFLSEYRKR YPDSAVPEPG SAATGNRAEA PSPPPG