Gene Mext_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1995 
Symbol 
ID5831556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2224832 
End bp2225998 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID641367796 
Productregulatory protein LuxR 
Protein accessionYP_001639465 
Protein GI163851422 
COG category[K] Transcription 
COG ID[COG2771] DNA-binding HTH domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCC ATTCCCACCC CTTTCCCTCC GCCATTCGCT CGGCCTCGAT CGAGCCGGCC 
GCCGCCCTGT CGGCCGAGGG CGCCGCCCTG ATCGACCGGA TCTACGAGGC GGCGGCCTTG
CCCGAGCTGT GGCGCGATGT GCTCGTCGAA CTCGCCCGCT TCGCCGGCGC CCCGCAGGCG
GTGATGATCG TCTCGACCGG GACGCACTTT CGCGACTTCG TGACGACGTC GCCGGAGTTC
GACCCGTTGG TGATCGATCA TTTCGAGCGC TTCCCCGACA ACGTCCGCAT CGGACGCCTG
TTGGCGCTGC GTCATCCCGG CTTTCTCAAC GACCTCGACG TCGTGACGGA GGAGGAGATC
GCGACGCTGC CGCTCTATCA GGACTTCCTG ATCCCCCGGG GCTACGGCGC GGGTACCGCG
ACGGCCGTGC TGGTGCCGAG CGGCGACAGC GTCATCGTCC ATTGCGAGCG CGCCCGCGCC
GAGGGCGATT TCGGACCGCA GATTCTGTCC GCACTCAACA GCCTGCGTCC CCATCTCGCG
CGGGCCGCCC TGCTTTCCGC ACGCCTGGAG ATGGAGCGGG TCTCGACCAC CACCCGGACG
CTCGAAGCGC TCGGCCTGCC GGCGGCCGTG CTCGGAAGCG GCGGGCGGGT CATCGACGCC
AATCCGTCCC TGGTGGCGAT GATGCCTCAC ACCCTCAGCG ACCAGCCCTT GCGGCTCGCC
GTCGTCGATC CGGCCGCCGA CAGGCTGCTG CGCGAAGCCG TGGCACAATC CGCCTCGACG
CAGGCGATGC CGGTGCGCTC GATCCCGATC GCCGCGAGCG GTGAGCGTCC CCCGGTGATC
CTGCATCTCG TGCCGATCCG CGGCGCGGCC CACGACGTGT TCGTCCGCGC CCGCTTCGTG
CTGATCGCGA CCCCCGTCGT GGCCCAGGAC GTGCCGAGTG CGGATGTGGT CCAGGGCCTG
TTCGACCTGA CGCCGGCCGA GGCCCGGCTC GCCGCCCTGA TCGCGGCGGG CGATGCCCCG
GCACCGGCCG CGGCCAAGCT CGGGATCACC CCCAGCACCG CCCGCTCGGT GCTCCGGCGC
ATCTTCCAGA AGACCGGCGT GTCGCGCCAA GCCGAGCTCG TCGGCCTGCT CGCCGGCCGG
GGCGCCGGGT CGGGATTGCG CGAATAG
 
Protein sequence
MTIHSHPFPS AIRSASIEPA AALSAEGAAL IDRIYEAAAL PELWRDVLVE LARFAGAPQA 
VMIVSTGTHF RDFVTTSPEF DPLVIDHFER FPDNVRIGRL LALRHPGFLN DLDVVTEEEI
ATLPLYQDFL IPRGYGAGTA TAVLVPSGDS VIVHCERARA EGDFGPQILS ALNSLRPHLA
RAALLSARLE MERVSTTTRT LEALGLPAAV LGSGGRVIDA NPSLVAMMPH TLSDQPLRLA
VVDPAADRLL REAVAQSAST QAMPVRSIPI AASGERPPVI LHLVPIRGAA HDVFVRARFV
LIATPVVAQD VPSADVVQGL FDLTPAEARL AALIAAGDAP APAAAKLGIT PSTARSVLRR
IFQKTGVSRQ AELVGLLAGR GAGSGLRE