Gene Pmen_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_3972 
Symbol 
ID5110195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp4351907 
End bp4353142 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID640505235 
ProductHK97 family phage portal protein 
Protein accessionYP_001189451 
Protein GI146308986 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.25148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGC TGGAGCAGCA GCTGGCTGCC GTCAACAATC AGGATCGGGC GCCGGTATCT 
ACCAGTAACC GCGAGGGCAT TCTCGACCTG TTCAACGTCA GCCCGAGTCA TGCCGGGCCT
GTGGTCAACG CCACCACCGC CATGCGCAGC TCGGCTGTCT ATGCCAGTGT GGCGCTCATT
GCCGGCGCGA TTGCGAGCAT GCCAAAGCCC ATCTTCCGGC GCACGGACGG GGGGCGGGAG
CAGACCGAAC ATCCGTACTG GTGGCTGCTC AATGAACAGG CTACGCCTTG TTTTTCGGCC
TACGCCTTCT GGGAATACTT GGTCAGCTGC AAGCTGCTGC GCGGCGATTC TTATGGCTAC
ATCGTGCGTA ACGGCGCCGG AATACCCATC GAGCTGGTGC CCGTGCCCTG GGCCAATGTG
ATTGCCGAGA AGCGTGCGCG GCGGCTGGTG TACTTCATCG AGCTGGATGG TGAATACATC
GGCGTTGATC AGGACGACAT GCTGCATTTC CCCAGCCTCG GGTTTGATGG CGTCAAGAGT
CCCTCCGTGA TCAAGCTGGC CGCCAAGCAG GCTGTGGGGG TTTCGTTGGC TGCTGAAGAG
TACGCCGGCC GCTTCTTCAG CAACGGCGCG CGACCTGACT TCGCCCTGAA GCACCCAGGA
AATCCGACCA GGGAGCAGGT CAACCTGCTG CGCGAAATGT GGGCAGAGCG CCACCAGGGC
GTTGGGCAAA GTCACCTGCC AGCGGTGCTG ACCGGCGGCA TGGATGTGGC TGAGCTCACG
ATGTCCGCCG AGGACTCCCA GCTGTTGGAG ACGCGGAAGT GGCAGGTGAT CGACATTGCT
CGCGCCTTTG GCGTGCCGCC GCACATGATC GGTGAGACGG AGAAGACCAG CGCCTGGGGT
ACTGGCATCG AGCAGATGTC CATCGGCTTC GTCCGCTTCA CACTGAACCG CCACCTGAGG
CCTATCGAGC AGGAGCTCAA CCGCAAGCTT TGGCCAAACA GCCCGCGCTA CTTCGTGGAG
TTCAACCGCG AAGGCCTGCT GGCCGGCGAT AGCAAGGCCG AGGCTGAATA CTTCACCAAG
GCCCTGGGCG GCCCCGGTAA TCGGGGTTAC ATGACGGTGA ACGAAGTTCG CCGCATCAAG
AATCTGCCGC CTATCGAAGG TGGTGATGTC CTCTACACAC CGGAGAGCAC CAGCAATGCG
CAACCGCCTG CTGAACCTGA TCAAGGACAA CCTTAA
 
Protein sequence
MHELEQQLAA VNNQDRAPVS TSNREGILDL FNVSPSHAGP VVNATTAMRS SAVYASVALI 
AGAIASMPKP IFRRTDGGRE QTEHPYWWLL NEQATPCFSA YAFWEYLVSC KLLRGDSYGY
IVRNGAGIPI ELVPVPWANV IAEKRARRLV YFIELDGEYI GVDQDDMLHF PSLGFDGVKS
PSVIKLAAKQ AVGVSLAAEE YAGRFFSNGA RPDFALKHPG NPTREQVNLL REMWAERHQG
VGQSHLPAVL TGGMDVAELT MSAEDSQLLE TRKWQVIDIA RAFGVPPHMI GETEKTSAWG
TGIEQMSIGF VRFTLNRHLR PIEQELNRKL WPNSPRYFVE FNREGLLAGD SKAEAEYFTK
ALGGPGNRGY MTVNEVRRIK NLPPIEGGDV LYTPESTSNA QPPAEPDQGQ P