Gene Mext_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2100 
Symbol 
ID5833207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2353112 
End bp2354374 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content65% 
IMG OID641367897 
Producthypothetical protein 
Protein accessionYP_001639566 
Protein GI163851523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.281286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00458205 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAATCA AAACAAAGCT TCTGGCCGCG ACCGCGGTGC TTTCCACCCT GAGCATCAAC 
GCGTTGCCAG TCCTCGCCGC AGATATGCCT GCCGCCAAGT CGGCGCCCGT CATCGTCGAG
GAGCATTGTA AGGCTGCGAT CTCCACCCCG ACCTTCGGCG GTCTCATCAA GGCGAACCCG
AACCCGGCCT GCATCGTGAC GGGACTGGGC GACATCTATG TCGGCGGCGC GGTCACCGGC
TTCGCCTACA CCCAGACCAA CGCCTTCGGC ATCCTCTCGC CCAGCGCTGA GCAGGACCGC
TTCGGCCGCG TCGACTTCTC GAACCTCCAG GGCTGGATCC AGAAGGCCGA CGGCCCGCTG
CAATTCTACG TCCATGCCGG CCTGTACTCG ATCCCGGCGC TCGGCCTGCC GCTCTACTCC
GCGTTCGAGC AGACCGAATC GCTGTTCGGC CCGATCCCGG TGGCCTTCGG CAAGTGGCAG
ATCAACGACG AGTGGTCGAT CCAGGCCGGT CGGATGTTCA CCAACATCGG CTCCGAGCTG
CTGTTCACCT ACCAGAACCT GAACATCTCC CGCGGTCTGC TGTTCAACCA GGAGAACTTC
ATCAACCACG GCGTCCAGGT GAACTACGCC AACGGCCCGT TCGCGGCCGC TCTCGCGGTG
ACCGACGGCT TCTACTCGGG TGAGCTGAAC TGGGTGACGG GCTTTGCCAC CTACAAGCTC
AACGACGCGA ACACGATCGG CATCAACGGC GGCACGCATT TCAGCGATTT CGACGCCTCG
ACCCGCAGCC CGCGCTTCCA GTTCGCGACG ATCAACTCGC TGCAGAACAG CAGCATCATC
AGCGTGAACT ACACCTACGC CAACGGGCCG TGGATCATCT CGCCGTACTT CCAGTACACA
AACGTCGCGC GCAAGGAGGG GTACTTCTCC CCGATCGAGG GCGCGGAGAC CTGGGGCGGC
ACGCTGCTGG CCGGCTACAC CTTCACCGAC AACTTCGCGC TCGCCGGCCG CCTCGAATAC
ATCGAGCAGT CGGGCACGCG GGGCGTGGTC ACCGGCCGCG GCGGCACCAG CGTCCTCTAC
GGTCCGGGCA GCTCGGCCTT CTCGTTCACG ATCACCCCGA CCTTCACCTG GGATCGCTAC
TTCCTCCGCG GCGAGTTCGC GACCGTCCAG GCCTACGACG TGACCCCCGG CTTCGGCTTC
GGCCGCGACG GCACCAAGCG CTCGCAGGAG CGCTACCTCG TGGAGACCGG CTTCACCTTC
TGA
 
Protein sequence
MTIKTKLLAA TAVLSTLSIN ALPVLAADMP AAKSAPVIVE EHCKAAISTP TFGGLIKANP 
NPACIVTGLG DIYVGGAVTG FAYTQTNAFG ILSPSAEQDR FGRVDFSNLQ GWIQKADGPL
QFYVHAGLYS IPALGLPLYS AFEQTESLFG PIPVAFGKWQ INDEWSIQAG RMFTNIGSEL
LFTYQNLNIS RGLLFNQENF INHGVQVNYA NGPFAAALAV TDGFYSGELN WVTGFATYKL
NDANTIGING GTHFSDFDAS TRSPRFQFAT INSLQNSSII SVNYTYANGP WIISPYFQYT
NVARKEGYFS PIEGAETWGG TLLAGYTFTD NFALAGRLEY IEQSGTRGVV TGRGGTSVLY
GPGSSAFSFT ITPTFTWDRY FLRGEFATVQ AYDVTPGFGF GRDGTKRSQE RYLVETGFTF