Gene Mext_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1433 
Symbol 
ID5833624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1603754 
End bp1604926 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content71% 
IMG OID641367233 
ProductHK97 family phage portal protein 
Protein accessionYP_001638905 
Protein GI163850862 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.273066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGAT TTATCGCGCG GCTCGCGAGG GCGGCCGGGT TCGTCCCTGA GACGAAAGCG 
AGCGCGGCTT TTGCGCTCTA CGGCGAGGGA CGAGCGATCT GGACCGCGCG CGATTGCGCG
GCTTTGGCCC GCGAGGGTTT CCAGCGCAAT GCCGTCGTCC ACCGCTCGGT TCGGCTCATC
GCCGAGGCCG CGGCCTCCCT GCCGCTGACG CTGGCCCGCG CCGACGATGC CCATCCGCTG
CTGGACTTGC TCGCCCGGCC GAATCCGCGC GAAGGCGGGA TGCGCTTCCT CGACGGGATC
TATGGGCACC TGCTCGTCTC CGGCAATGCA TACATCGAAG CGGTCGAGAT CGATGGTCGA
CCTCGTGAAC TGTTCTCCCT GCGTCCCGAC CGGATGCAGG TCGTGGCCGG CGCCGACGGC
TGGCCCGCGG CTTACGAGTA CGCCGTCGGG GGGCGCCGAC TCCGCTACCA GCAGACCGGC
GCCGTGCCGC CGATCCTGCA CCTGACGCTG TTCAACCCGC TCGACGACCA TTACGGCCTT
TCGCCGATGG AGGCGGCGGC GGTCCCGCTC GACATCCACA ACGCGGCCGG CGCCTGGAAC
AAGGCCCTGC TCGACAACGC CGCCCGCCCG TCCGGCGCTC TGGTCTTCGC GCCCTCGACC
GGCGCCGCCT TGAGCGACAC GCAGTTCACG CGGCTCAAGG CCGAGCTGGA AACGAGCTAC
CAGGGCAGCG CCAATGCCGG CCGGCCGCTC CTCCTCGATG GCGGGCTCGA TTGGCGCCCG
CTCTCGCTCT CACCGAAGGA GATGGACTTC GTCGAGGCGA AGGCCGCCGC TGCCAGGGAG
ATCGCGCTCG CCTTCGGCGT GCCGCCGCTC TTGCTCGGTC TTCCCGGCGA CAACACCCAC
GCGAATTACG CCGAAGCCAA CCGTGCCTTC TACCGTCAGA CGGTGATCCC GCTGGTGCGC
CGCACTGCCG ATTCCCTGGC GCGCTGGCTG GAGCCCGCCT TCGGCCCCGC GCGGTTGGAG
CCGGATCTCG ACGCGGTCGA AGCGCTGGCG ACCGAGCGCG AGTCGCTCTG GCGCCGGGTG
CAAGGCGCGG ACTTCCTGTC GGTCGCCGAG AAGCGCGAGG CCGTCGGCTA CCCTCCCCAG
AGTCCGGGGC AAGGCACGGG CTCTCCGGCC TGA
 
Protein sequence
MPGFIARLAR AAGFVPETKA SAAFALYGEG RAIWTARDCA ALAREGFQRN AVVHRSVRLI 
AEAAASLPLT LARADDAHPL LDLLARPNPR EGGMRFLDGI YGHLLVSGNA YIEAVEIDGR
PRELFSLRPD RMQVVAGADG WPAAYEYAVG GRRLRYQQTG AVPPILHLTL FNPLDDHYGL
SPMEAAAVPL DIHNAAGAWN KALLDNAARP SGALVFAPST GAALSDTQFT RLKAELETSY
QGSANAGRPL LLDGGLDWRP LSLSPKEMDF VEAKAAAARE IALAFGVPPL LLGLPGDNTH
ANYAEANRAF YRQTVIPLVR RTADSLARWL EPAFGPARLE PDLDAVEALA TERESLWRRV
QGADFLSVAE KREAVGYPPQ SPGQGTGSPA