Gene Mmar10_0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0919 
Symbol 
ID4285251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1017978 
End bp1019033 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID638140387 
ProductHK97 family phage portal protein 
Protein accessionYP_756150 
Protein GI114569470 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.38293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACT GGCTCGACCG GCTGTTTGGC CGGGAGGAAA AATCGGCTCT GTCCAGCCGT 
CTGATTGCCC TGACGGCGGG GACGCGGTCG GCCTGGTCGC CGCGCGCGCT GCCGGCGATG
ATGCAGGCCG GCTATGCCCG CAATGCCATT GTCCATCGCT GTGTCCGGCT GACGGCGGAG
GCGGCGGCCT CGGTGCCGCT CGCCGCGTCG GACGGTGCCA TCACCCGCCT GCTCGACATG
CCCAATCCGG ACCAGTCCGG GCCGGAGCTG TGGGAAAGTC TCTATGGTTA TCTACAGCTG
GCGGGAAATG CCTATCTGGA ACTTGCCAGC CTGGCCGATG AACCCCGCGC GCTACACCTT
CTGCGGCCGG ATCGTATGCG TGTGCTGGCC GGGCCGACCG GCTGGCCGGA CGGGTGGGAA
TACGCCGCCG CCGGCCGGAA GCGGCGTTTC CAGCGGGACC GGGTCTCGGG TCGCAGCCCG
GTTTTCCACA TCCGCCTCTT TCATCCCGGC GATGATCATT ACGGCCTGTC ACCGCTGGAG
GCGGTCGGCC GGGCGCTGGA TCTTCATACT GCCGGCTCCG ATTGGGCGCG CGCCCTGCTC
GACAATGCGG CGCGGCCGTC CGGGGCGCTG GTCTTCAAGG GCGAAAACGG CCAGCTCTCG
CCGGACCAGT TCGAACGCCT GAAGGCCGAG CTGGAAGCCA GCCATACCGG CGCCGCCAAT
GCCGGCCGGC CGCTGCTGCT GGAAGGCGGG CTCGACTGGA CGCCGATGGC GCTCAGCCCG
GCCGAGATGG ATTTCACCAC CGCCCGCCGC GAAGCCGCCC GCGAGATCGC GCTGGGGCTC
GGCGTGCCGC CGCTTCTGCT GGGCCTGCCC GGCGACAATA CCCACGCCAA TTACGCCGAA
GCCAATGCCG CCTTCCTGCG CCAGACCGTG CGTCCGCTGG TGATGAAAAT GGCCCGCGCC
CTGAGCGTCT GGCTGCGGCC CTGGTCTGTG CCTGACCTCG AGATACGGCC GGACTTTGCC
GCGCTCGAGG AAGCCGGGGA GGCGCGCCAT GCGTGA
 
Protein sequence
MPHWLDRLFG REEKSALSSR LIALTAGTRS AWSPRALPAM MQAGYARNAI VHRCVRLTAE 
AAASVPLAAS DGAITRLLDM PNPDQSGPEL WESLYGYLQL AGNAYLELAS LADEPRALHL
LRPDRMRVLA GPTGWPDGWE YAAAGRKRRF QRDRVSGRSP VFHIRLFHPG DDHYGLSPLE
AVGRALDLHT AGSDWARALL DNAARPSGAL VFKGENGQLS PDQFERLKAE LEASHTGAAN
AGRPLLLEGG LDWTPMALSP AEMDFTTARR EAAREIALGL GVPPLLLGLP GDNTHANYAE
ANAAFLRQTV RPLVMKMARA LSVWLRPWSV PDLEIRPDFA ALEEAGEARH A