Gene Mext_4733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4733 
Symbol 
ID5832001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5285921 
End bp5287309 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content67% 
IMG OID641370530 
Producthypothetical protein 
Protein accessionYP_001642172 
Protein GI163854129 
COG category[S] Function unknown 
COG ID[COG3034] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGC GTCCGTTTGT GGCGGCTGCT GCCGTCGCTC TGGCGATGAC CCTCGGTGCC 
TGCCAGGACA GCGCCATGCT CGGCGGCGCT TCGACCCGCA GCCTGACGCC GATTCCCCCG
CAGACGCTCG CGCTCATGCA GACCAAGGGC ATGAGCCAAT CCGACCCGAT CCTGATCCGC
GCCTTCAAGA AGGAAGCGGA GATGGAGGTG TGGAAGCGCG GCGCCAACGG CCAGTACGCG
CTGCTGAAGA CCTTCCCGAT CTGCCGCTGG TCGGGTCAGC TCGGCCCCAA GCTGAAGCAG
GGCGACCGGC AGGCGCCGGA AGGGTTCTAC GCGATCACGC CGGGTCAGAT GAACCCGAAT
TCCAGCTACT ACCTCTCCTT CGATGTCGGC TACCCCAACG CCATCGACCG GGCCAAGGGC
GGCACCGGCA ATTACATCAT GGTGCACGGC ACCTGCTCGT CGTCGGGCTG TTTCGCGATG
ACCGACGCCT CGATGTCGGA GATCTACGCC ATCGCCCGCG AGGCCTTCAA CGGCGGCCAG
CGCGCCTTCC AGTTCCAGTC CTACCCGTTC CGGATGACGG CCTCGAACAT CGCCAAGTTC
CGCAACGACC CCAACGCGCC GTTCTGGAAA AACCTCAAGG AAGGCTCGGA CTATTTCGAG
ACCCTCAAGG AGGAGCCGCG GGTCGCGGCC TGCGGCACCA AATACGTGTT CGGCGGTGCC
GATGTGGCGG CGGGCAACTG CACGCCGCGG GTCGATCCGC TGGTCGCCGA GAAGCGCGAC
CGCGACAGCC ACGAGGTTGC CGAGCTGATC GCCAAGGGCA CGCCTGCGAC CCGCGTCGTC
TATGACGACG GCGGCCAGAA CCCGGTCTTC CGCCCGCAGA AGCCGGAGGG GCCGACCTTC
GCAGCCCTGA TCGAGAACGA GAAGGAAAAG GAGAAGGAGC TCGCCTATAC GGCCAAGGAA
TACGGCCGCT ATCACCTCGG TGACGTCAGC CGGCCGGAGA GCCTCGCGCT CGGGCCGAAC
GAGTTCGAGG TCGATGCCAA GGGCAAGCCC GTCCTGATCG CCGCCGCTCC CGCCGACGCG
CCCGGCCCGA CCAAGGCCGC CGCCGCCCGT AAGGAGCAGG TGAAGCCGAC GACGGTGGTC
GCCGTGCAGG AGCCGGCCAA GGCGGCTGAC AGCAAGGCGG GTCACACCAA GCCCGGCTCA
GCCAAGCCGG CCCGTGTCAC CGTGGCCGAC GCGGACGGCG ATGTGAGCGC CTTCAGCAAG
GTCCTGGGCA AGAAGCCCGG CAAGGACGAG AAGGCCACCG AGGCCCACAA GCCTGAGACT
CACAAGTCCG AGGCCCCCAA GGCGAAGCCG CAGGCCGCGG TCACGACCAC CGGTTCCCTG
CGGAACTGA
 
Protein sequence
MAVRPFVAAA AVALAMTLGA CQDSAMLGGA STRSLTPIPP QTLALMQTKG MSQSDPILIR 
AFKKEAEMEV WKRGANGQYA LLKTFPICRW SGQLGPKLKQ GDRQAPEGFY AITPGQMNPN
SSYYLSFDVG YPNAIDRAKG GTGNYIMVHG TCSSSGCFAM TDASMSEIYA IAREAFNGGQ
RAFQFQSYPF RMTASNIAKF RNDPNAPFWK NLKEGSDYFE TLKEEPRVAA CGTKYVFGGA
DVAAGNCTPR VDPLVAEKRD RDSHEVAELI AKGTPATRVV YDDGGQNPVF RPQKPEGPTF
AALIENEKEK EKELAYTAKE YGRYHLGDVS RPESLALGPN EFEVDAKGKP VLIAAAPADA
PGPTKAAAAR KEQVKPTTVV AVQEPAKAAD SKAGHTKPGS AKPARVTVAD ADGDVSAFSK
VLGKKPGKDE KATEAHKPET HKSEAPKAKP QAAVTTTGSL RN