Gene Mmar10_0922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0922 
Symbol 
ID4285254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1020163 
End bp1021374 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID638140390 
ProductHK97 family phage major capsid protein 
Protein accessionYP_756153 
Protein GI114569473 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.255185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAAGG AAACCAAGAT GACCGCCGCG ACCGGTGACA GCCGCGCGGT GATGGGCGAG 
CTGCTGGCTG CCTTCGAACA GTTCAAACAG GCCAATGACC AGCGCCTCGC CGAGATCGAG
ACCCGCGCCG CTGCCGATGT GCTGCTGGAG GACAAGGTCG CCCGCATCGA CAAGGCGCTC
GACAGCCAGA AATCCGCGCT CGACCGTCTC GTCCATGAAG CGGCCCGTCC CGGCCTCGCT
CCGGGTGCCG ACAGGTCGGC GGCCTCGACG GGCTTTGCCG CCTATATGCG CAGCGGCCAG
CTGGCCGAAG GCAAGTCCGC CACGGCCGGC ACGCCCGGTG AGGGCGGTCA TGTCGTGCCG
GCCGAAACCG AGGCCCGGAT CGACCGGCTG CTGGCCGAAG CCTCACCCAT TCGCGCCATC
GCCACGGTGC GCCAGACCGC GTCCGGCACC TTCCGCAAGC CGGTCTCGCG CGGCGGGGCG
GCGACGGGCT GGGTGTCCGA AACGGCGGCC CGCCCGGAAA CCGATGCGCC CAGTCTCGAA
CTGATCGAGT TTCCCGCCGC CGAGCTTTAC GCCATGCCGG CCGCCACCCA GCAATTGCTC
GATGACGCCA TGGTCGATGT CGAGGACTGG CTGGCCGAGG AGGTCCGCGA CGTCTTCGCC
GCCCAGGAAA GCGCCGCCTT TGTCTCCGGC GACGGCATCA ACAAGCCGCG TGGCCTGCTG
GACTACACTG CCGTCGCCGA GGGCACCCAA GCCTGGGGCG AGCTCGGCTA TGTCGCCACC
GGCACGGCGG GCGGGTTTGA CGCGACAGAT CCCGCCGACG CGTTGATCGA TCTCATCTAT
GCACCCAAGA CCGCCTATCG CGCCAAGGGC CGCTTCCTGA TGAACCGCCA GACCGTCTCC
GCCGTGCGCC GCTTCAAGGA TGCCGACGGC AATTATCTCT GGCAGCCGGC GCTGGGCGAG
GGGGCGAGTT CGACCCTGCT CGGCTATCCG GTCACCGAGG CCGAGGACAT GCCCGATATC
GGCACTGACA GCGCCTCGAT CGCTTTTGGT GACTTCGCCC GCGGCTATCT GGTGCTGGAC
CGCCAGGGCG TCGAAGTCCT GCGCGACCCG TTCAGCGCCA AACCCTATGT CCTCTTCTAC
ACCACCAAGC GCGTCGGCGG CGGAGTGCAG GATTTCGAAG CGATCAAGCT GCTGAAGTTT
GGTGTGAGCT GA
 
Protein sequence
MSKETKMTAA TGDSRAVMGE LLAAFEQFKQ ANDQRLAEIE TRAAADVLLE DKVARIDKAL 
DSQKSALDRL VHEAARPGLA PGADRSAAST GFAAYMRSGQ LAEGKSATAG TPGEGGHVVP
AETEARIDRL LAEASPIRAI ATVRQTASGT FRKPVSRGGA ATGWVSETAA RPETDAPSLE
LIEFPAAELY AMPAATQQLL DDAMVDVEDW LAEEVRDVFA AQESAAFVSG DGINKPRGLL
DYTAVAEGTQ AWGELGYVAT GTAGGFDATD PADALIDLIY APKTAYRAKG RFLMNRQTVS
AVRRFKDADG NYLWQPALGE GASSTLLGYP VTEAEDMPDI GTDSASIAFG DFARGYLVLD
RQGVEVLRDP FSAKPYVLFY TTKRVGGGVQ DFEAIKLLKF GVS