Gene Mext_4216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4216 
Symbol 
ID5833284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4691401 
End bp4692858 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content71% 
IMG OID641370007 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001641656 
Protein GI163853613 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0428969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.525681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAAACC GCGTCCCTCC CACGGCCCCT CGTCTTGTGT CGCTCGGGCC GGGCCGGCTA 
TGGGGCTTGG CGGCGGGATC ATGCGAGGCG TTTGCGGTGC GACAGGGCGA TGCGGAGGGC
GATGATCCGC GCGTCGCAGC ACTGCTCGCG CGTCTCTCGG ACGACAAGGT GCCGTTCTCG
ATCGCCGGGC GCGACGACTT GGCCGGGCTC ATCCTGTCGC GCGCCGCGCT GAACGGTCAC
ATCGATCCGA CGGCGCCGCC GCGCTGGTGG ACGCAGGGCG CGGGCGGGCT CGATCTGGCC
GGCGCCGACC TCGCCGACGC CCGGCTGGAG ATGACCGATT TTTCCGACGC CAACCTGCGT
CGCGCCTCGC TCGCCGGCGC GCTTGCGCGC TCGGCCGGCT TCGCGAATGC CTGCCTGGAG
GAAGCGGACT TTGCCGGCGC CGACCTCAGC GGCGCGCGCT TTACCGGAAT TGCCGGCGGG
CAGGCCTCCT TCCGCGAGGC GATGCTGGAG GATGCCGACT TCTCCGGCGC CACCATGCGC
TTTGCCCGGC TCGACAAGGC GCTCCTCGAC GGCGCCCGCT TCGAGGGCGC CGATCTGTGG
GGCACCGACT TCACCGGGGC GGATGCCGAC GATTCCGTGT TCCGAAAAGC CCGGCTCGAC
GAGGCCAACC TCTCCGACTG CAATCTCACC GGCGCGGATT TCGAGGGGGC GAGCCTGAAG
AAGGCGCGGC TCGTCGGCTC GCGGCTGCGC GGCGCCAACT TCTCCGGAGC CCACCTCGAC
GGGGCGGACC TGTCGGGGGC CGACTTCTCC CGCACCAGCC TCGTGCGGCT CGACCTCACG
ACATGCAAGC TGCACCGCGC GCGCTTTGCC GGCGCATGGC TGGAAGGCGT GCGGCTTACC
GTCGAGCAGA TCGGCGGGAT GGTCGGCGAA GAGGCGGCGG GCGAATACGA GGCGGCGCAG
GCGAGCTATC TCGCGCTCGA GCGCAACCTT CAGAGCATCG GCAGCCCCGA GGGCGCGAGC
TGGGCCTACA AGCGCGGGCG CCGCATGGGC CGCCGCCATG CCGGCGTGCG GGCCCGCGAG
GCCTTTTTCG CCCGTGATGT GCGGGGAACG CTGAGCTCCG GTTACCGCTG GATCGCCGAC
CGCTTCGTCG AGTGGCTGTG CGACTACGGC GAGAGCCTCT CGCGGATCGC CCGCGCCTTC
CTCGTCGGGA TCTTCCTGTT CGCCGGGGCC TATGGGGCGA CGGGCGGGCT CTTCCACGAG
GGTGAGAACG CGCCGACCTA CAACCCGCTC GATCTCGTGA GCTACAGCGC GCTCAACATG
ATGACCGCCA ACCCACCCGA GATCGGGGTG AAGCCGCTGG GCCGCGTCAC CAACCTGCTG
GTCGGGTTGC AGGGGGCGGC GGGGATCGTG CTGATGGGGT TGTTCGGCTT CGTCCTCGGC
AACCGCCTCC GCCGCTGA
 
Protein sequence
MQNRVPPTAP RLVSLGPGRL WGLAAGSCEA FAVRQGDAEG DDPRVAALLA RLSDDKVPFS 
IAGRDDLAGL ILSRAALNGH IDPTAPPRWW TQGAGGLDLA GADLADARLE MTDFSDANLR
RASLAGALAR SAGFANACLE EADFAGADLS GARFTGIAGG QASFREAMLE DADFSGATMR
FARLDKALLD GARFEGADLW GTDFTGADAD DSVFRKARLD EANLSDCNLT GADFEGASLK
KARLVGSRLR GANFSGAHLD GADLSGADFS RTSLVRLDLT TCKLHRARFA GAWLEGVRLT
VEQIGGMVGE EAAGEYEAAQ ASYLALERNL QSIGSPEGAS WAYKRGRRMG RRHAGVRARE
AFFARDVRGT LSSGYRWIAD RFVEWLCDYG ESLSRIARAF LVGIFLFAGA YGATGGLFHE
GENAPTYNPL DLVSYSALNM MTANPPEIGV KPLGRVTNLL VGLQGAAGIV LMGLFGFVLG
NRLRR