Gene Mext_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3606 
Symbol 
ID5831960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3984648 
End bp3986588 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content73% 
IMG OID641369399 
Producthypothetical protein 
Protein accessionYP_001641055 
Protein GI163853012 
COG category[S] Function unknown 
COG ID[COG5338] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCTA CGCGGCGGCT GCTCACGGCG GGACAGGGCG AGCGGGCGCC CCGCAACGGC 
ACGTTAACCA TGCCCGGTGT AGCCCCGTTA ACCATGGAGC GCCGGCCGGA CACGGAACGC
CGCCCCGTCC ACGGGCCGCC GGCCGTATCC AGGGAAAGAG CGCTGCCGGT GTCACGCGAG
CGGCCGAACG GGGACAAGCG AGGGGGGCGC CGCGGGGCGT TCGCTCTGGC GCTGCTCCCC
GCCCTGCTCG GGGGCCCCGC GCTGGCGCAG GAGACGCCCG ACCCGTCGCA GGCCGAGCCC
GCCCCGAGCA CGGCGTCGAA CCCGCTCGCG CGCAGCCCCT CCGAGCCGGC GACCGCGGGC
CGCCCGCGCC GCTCCGCCTT CGACGCGCCG ACGGCTTTGC GCGGCTCCGG CGCCTTCTCC
GCCCCCGCCC CGCTCGGCAG CGGCACGACC GCGAATCCGG CCCCCGCCTC CGAGGAGAGC
GAGGAGCCGA GTGTGGCCCG CCTGCCGCGC TTCCGCAGCG CGACGAGCCT GCCCGGCTCC
GCCGCGACGC GGGGCACGCC GGCCCGGCCC TCGATCTTGC GCCTGCGCGC GGCGCCCCCG
CGCCGGCTCG GCACGGCGAC CCGCCAGATC ACCCAGACGC GCACGCAGCA GACGATCACG
GATCTGCGCC TCACCCCGGT GATCCAGACG CCCGTCTCCG GCGTGCCGCT GCCGGCGCCG
ATCCTCGGGC TCGGCCTGCC GAACGCCGCC GGGCTGCTGC TCGGCACGGC GCTCCGCCGG
CCGATCCCCG CCGACACCGC CTACGCGCCG CTCGGCATCC GGCTCGGCAC CTTCACGCTG
CTGCCGGCCT TCACCCAGAG CGTCGGCTAC GATTCGAACC CGGACCAGAT CGGCGGCACC
CGCCTGCGGC CCTCCCTGGC GCTGCGCAGC GAGGCGGAGC TGGCATTGCG CAGCGAGTGG
TCGGCGAGCG AACTCACCGC CGAGATGCGC GGCAGCTACC TCGAATATCC GCAGAACCCC
GAGGCGAGCC GCCCCAATGC GGTGGGCACC GCGCGGATGC GCATCGACGT CGACCGCGAC
ACCCGCATCG ATCTGGAGAC CCGGTTCCTG CTCGACAGCC AGCGCATCGG CAGCCCGGAT
CTCGGCGCGG GCGGGGGGGC GACGACCCGG CCGCTCTTCG CCACCTACGG CGCGACCGCG
GGCGTGCAGG AAAACTTCAA CCGGCTGCAG CTCTCGCTGC GCGGCTCGAT CGACCGCTCG
GTCTTCGAGG ATGCGCAACT CGGCAACGGC ACCACAATCA TCCAGAGCGA CCGCGACGCC
AACCAGTACG GCCTGCGCCT GCGCGCCGGA TACGAGATCT CGCCCGCGAT CACGCCCTTC
GTCGAGACCT TCCTCGACAC CCGGGTCTAC GACACGCCGG TGGACCAGTT CGGCCTGCGC
CGCGATTCCG ACGGCGTCGC CTTCACCGCG GGCGCGGCGG TGGCGCTCAA CAGCACGCTA
ACGGCGGAAA TCTCGGGCGG CCTGCAGCAC CGCTCCTACA TCGATCGCAC CCTGCAGGAC
ATCAACGCGC CGGTCGTCAA CGCGGCACTC ATCTGGTCGG TCTCGCCGCT GACCACGGTG
CGGTTCAACC AGCAGACCGG CGTGATCGAG ACCGCGGTGC CGGGCTCCAG CGGCGCCTTC
ACCGACGCCG CCACGCTCGA AGTGCAGCAC GACCTCTTGC GCAACCTCTC GATCACGCTG
GGCGGCGCTT ACCTCTCCAC CAACTACGAC GGCGTGCGCA TCCGCGAGCG GGGCTACTCC
GCCACCGCCC GGCTCGACTA CCGCTTCAAC CGCTGGCTGG CTCTCCGCGG CAGCTACATC
TACTCGACGC TGAACAGCAC CGTCCCGCTC TCGACCTACG AGGCGCACAC GGTGCTGCTC
GGGGTGCGGG TGAACCCCTG A
 
Protein sequence
MGATRRLLTA GQGERAPRNG TLTMPGVAPL TMERRPDTER RPVHGPPAVS RERALPVSRE 
RPNGDKRGGR RGAFALALLP ALLGGPALAQ ETPDPSQAEP APSTASNPLA RSPSEPATAG
RPRRSAFDAP TALRGSGAFS APAPLGSGTT ANPAPASEES EEPSVARLPR FRSATSLPGS
AATRGTPARP SILRLRAAPP RRLGTATRQI TQTRTQQTIT DLRLTPVIQT PVSGVPLPAP
ILGLGLPNAA GLLLGTALRR PIPADTAYAP LGIRLGTFTL LPAFTQSVGY DSNPDQIGGT
RLRPSLALRS EAELALRSEW SASELTAEMR GSYLEYPQNP EASRPNAVGT ARMRIDVDRD
TRIDLETRFL LDSQRIGSPD LGAGGGATTR PLFATYGATA GVQENFNRLQ LSLRGSIDRS
VFEDAQLGNG TTIIQSDRDA NQYGLRLRAG YEISPAITPF VETFLDTRVY DTPVDQFGLR
RDSDGVAFTA GAAVALNSTL TAEISGGLQH RSYIDRTLQD INAPVVNAAL IWSVSPLTTV
RFNQQTGVIE TAVPGSSGAF TDAATLEVQH DLLRNLSITL GGAYLSTNYD GVRIRERGYS
ATARLDYRFN RWLALRGSYI YSTLNSTVPL STYEAHTVLL GVRVNP