Gene Mext_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0299 
Symbol 
ID5832738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp334917 
End bp335951 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content72% 
IMG OID641366084 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_001637794 
Protein GI163849751 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.168302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCT CCGACGCCAC CCGCCGCTCT CTCCTGCGCC TCATGGGGTT GGCTGGCGTG 
GCCGGACTCT CCCCTGCCCT GTCGGGTTGC GTGATGGACG GGCTCGCCAC CGGCGCCGTC
GGCGAGACGC AGTCGGCGCT CGACGTGAGC CCGGTGCTCC TCATCGCCAC CACCCGTCGC
CCCGCCGCCG GCAATCCGCC CAAAGCGCCG TTCTTCGGCT CGGAGCGCGG CCGGGGCCTG
AGCTTTGCCG AGGCGCGCAT GACCGCGCCG GACCGCTCGC TGATCGGCAA GGTCTCGGCG
GTGGTGGGCG GCGATTGGGG CGTCCGCTCC GTCGGCGACG TCACGACGGG GTCGGGCGCG
GCGGCGGCCT TCGCCCAATC CGCCTTCGGC CGCGATGTGC TGATCTACGT CCACGGCTAC
CGCGAGAGCT TCGAATCGGC CGCAATCAGC GCCGCCCGCC TCTCCGACGG TATCCGCTTC
AACGGCGCTT CCGCCCTTTT CACATGGCCC TCGGCCGCGG CGACCTTCGA TTACGGCTAC
GACCGCGAGA GCGCACTGTG GTCCCGCGAC GCGTTCGAGG ACCTGTTGAA GACCGTGGCG
ACCACGCCGA GCGGCGGGCG CATCCACATC GTCGCCCACT CGATGGGCAC GCTCCTCACG
TTGGAGACGC TGCGCATGCT GCGGGCCGAG GCCGGCGAGG CGGCGGTCGC CCGGATCGGC
GCCGTGGTGC TTGCCGCGCC CGACATCGAC ATCGACCTGT TCACCAACGG CGTCGAGCGC
CTGGGGCCGG ACGCCAAGCG CATCACCGTC ATCTCGGCGA CGAACGACCG CGCACTCGAA
TTGTCGGGCG CCATTGCCGG CGGCGTCGTC CGCGCGGGCG CCGCCGACCG GGAGCGCCTG
GAGGCTCTGG GCGTGCGCGT GGCCGATGCC TCGGATTACG GCGGCGGCCT CTTCAACCAC
GATCTGTTCC TGTCGAACCG CGAGGTTCAG GCCGTCGTCA AGCGGGCCGT CTCGCGGGGC
AGCAGCGGCA CCTGA
 
Protein sequence
MQPSDATRRS LLRLMGLAGV AGLSPALSGC VMDGLATGAV GETQSALDVS PVLLIATTRR 
PAAGNPPKAP FFGSERGRGL SFAEARMTAP DRSLIGKVSA VVGGDWGVRS VGDVTTGSGA
AAAFAQSAFG RDVLIYVHGY RESFESAAIS AARLSDGIRF NGASALFTWP SAAATFDYGY
DRESALWSRD AFEDLLKTVA TTPSGGRIHI VAHSMGTLLT LETLRMLRAE AGEAAVARIG
AVVLAAPDID IDLFTNGVER LGPDAKRITV ISATNDRALE LSGAIAGGVV RAGAADRERL
EALGVRVADA SDYGGGLFNH DLFLSNREVQ AVVKRAVSRG SSGT