Gene Mext_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1431 
Symbol 
ID5833622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1601926 
End bp1603191 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content70% 
IMG OID641367231 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001638903 
Protein GI163850860 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.263231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA TGCGTTTCGA GACCAAGGCC CCCACCGGCC TGCCCGAGAA CAAGGCGGCG 
ACCTTCGGCA CCGATGCGGT GCTCGACGAG TTTGCCCGCG CCTTCGAGGC GTTCAAGGAG
GCCAACGACG TCCGCCTCTC CGAGATCGAG ACCCGGCTCA CCGCGGATGT GGTGACCGAG
GAGAAGCTCA TCCGCATCGA CGCCGCCCTC GATCAGGCGA AGAACCGCCT CGATCGGATC
AGCCTCGACC GTGCCCGGCC GCCGCTCGGC GGGACGGAGC CGGCGCGCGA CGCCTCCGCC
ACAGAGCACA AGGCGGCCTT CGACCTCTAT GTTCGAGCCG GCGAGAGCGC GGGTCTCAAG
CGACTGGAAG AAAAGGCACT TTCCGCCGGC TCCGGGCCGG ATGGCGGCTA CCTCGTGCCG
CCGACGATCG AGCGCGAGGT GCTGCGTCGG CTCGCCGAGA TCTCGCCGAT CCGCGCTATC
GCCACGGTGC GGGCCGTCTC CGGCGGCCAG TACAAGCGCG CCGTCTCGGT CAACGGTCCC
GCCGCGGGCT GGGTCGCCGA GACCGCGCCC CGTCCGCAGA CCGACACGCC GAACCTGTCC
GAGCTGAGCT TCCCGGCCAT GGAACTCTAC GCCATGCCGG CGGCGACCCA GACGCTGCTC
GACGACGCGG TGCTCGATAT CGATGCGTGG CTCGCCGAGG AAGTCGAGGC GGCCTTCGCC
GAGCAGGAGA GTGTCGCCTT CGTCACGGGC AACGGCGTCG GTCGGCCGAA GGGCTTTCTC
AGCTACGACA CCGTCGCCAA CGCGAACTGG GCTTCGGGCA GGCTCGGCTT CATCGCGACG
GGGGCGGCCG GCGCCTTCCC CGCGAGCAAC CCGAGCGACG TGCTGTTCGA TCTGATCTAC
GCGCTGCGCG CCGGCTACCG CCAGGGTGCG AGCTTCGTGA TGAATCGGCG GGTGCAGAGC
GCGATCCGCA AGTTCAAGGA CGCCGACGGC AACTACCTCT GGCAGCCGCC GCTTGCCGCC
GACCGGGCCG CGACGCTGAT GGGCTTTCCG CTGGTCGAAG CCGAGGCGAT GCCCGACATC
GCCGCCGGCA GCCACGCCAT CGCCTTCGGC AACTTCAAGC GCGGCTACCT CGTCGTGGAC
CGCGTCGGCC TTCGGACCCT GCGCGATCCC TACTCCGCCA AGCCCTACGT GCTGTTCTAC
ACCACCAAGC GCGTCGGCGG CGGGGTGCAG GACTTCGCCG CGATCAAGCT GCTCCGGTTC
GCCTGA
 
Protein sequence
MTEMRFETKA PTGLPENKAA TFGTDAVLDE FARAFEAFKE ANDVRLSEIE TRLTADVVTE 
EKLIRIDAAL DQAKNRLDRI SLDRARPPLG GTEPARDASA TEHKAAFDLY VRAGESAGLK
RLEEKALSAG SGPDGGYLVP PTIEREVLRR LAEISPIRAI ATVRAVSGGQ YKRAVSVNGP
AAGWVAETAP RPQTDTPNLS ELSFPAMELY AMPAATQTLL DDAVLDIDAW LAEEVEAAFA
EQESVAFVTG NGVGRPKGFL SYDTVANANW ASGRLGFIAT GAAGAFPASN PSDVLFDLIY
ALRAGYRQGA SFVMNRRVQS AIRKFKDADG NYLWQPPLAA DRAATLMGFP LVEAEAMPDI
AAGSHAIAFG NFKRGYLVVD RVGLRTLRDP YSAKPYVLFY TTKRVGGGVQ DFAAIKLLRF
A