Gene Mmar10_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0688 
Symbol 
ID4284840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp788768 
End bp790153 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content62% 
IMG OID638140153 
Producthypothetical protein 
Protein accessionYP_755919 
Protein GI114569239 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.714062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.621954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA ATTCAGCGCT TTTGGCCGGC GCATCCGGCC TGATCTCGAA CTCGAACGCC 
CTCGCCTCGA TTTCGGACAA TATCGCGAAC GTCAACACGG TGGGCTACAA GCGCGCCTCG
GCCGTCTTCA CGCCGCTCTA TGACAGCGAT AGCTCGACCA AGGGCTACTC CGCTGCCGGT
GTGAATTCGG TTGCCCGTCT TGCCATCGGT GAGAGCGGCC TGCTGACGGC CGGCTCCTCG
CCAACCGACC TGGCCATCTC CGGCCGCGGC TTCTTCGTGG TCCGCGACGA TGCCAACGTC
AATTCGTCCG ATCCGGTCTC CTTCACCCGT GCCGGCCGCT TCACGCCGGA CACGAATGGC
TATCTGCGCA ATGACGCCGG CAAGTACCTG TCTGGCTGGC CGGTCGCGGC TGACGGTTCC
GTGCCGCAAA ACCCGTCCGA CCTCAACGCG CTCGAAACCA TCAACCTGTC CTCGATCGGT
GGCGCTGCTG AAGCGACAAC GATCATGGGC ATCAATGCCA ACCTGCAGCA AAGCCAGGCG
ATCTCGGCCG ACGAGGCCAC CTATGATGCG ACTGCTTCCG CCACCAACAT GTCGTCGGGC
ACGGTCACGC CGGACTTCCA GCGCTCGATC CCGTTCTATG ACAGTGTCGG CGGTGTCCGC
ACCCTGACCA TCTCGATGCT CAAGAGCTCG ACGCCAAACC AGTGGCACGC TGAAGTCCAC
ATGGTCCCCG CCACCGATCT CACGACCGGG GCTGGCCTGG TCGACGGCCA GATGCTGACG
GGTACGGTCG CCTTTGACGC CCAGGGCCGT ATTGACAGCG CCAATACCAC GCTGCCGACC
CAGCTCGACT TCCTGTCATC GACCAATGCC GCCGCGCTCG GTGCCACAGA ATTCCAGTGG
GCGGCCGCAA CCGGCATCGA TGCCCAGTCA ATCGCCCTGG ACTTCGGCTC GCCCAACGCA
CCGGGCGGCT TCACGCAGTA TGACAGCCCG TCGGCCCTGC TCTCGACCAA TGTGAACGGA
TCGGCTTTCG GTAACTTCTC GAGCGTCGAT GTTGATGATG ACGGCTTCGT CTTCGCCAAG
TTCAACAATG GCATCGTCCG CAAAATCTAC CAGATCCCGG TCGCGACCTT CGTCAATCCG
GATGGCCTGG AAGCCCAGTC GGGCGGTACC TTCACAGTGA CGCCGGAATC GGGTGCCTAC
ACGCTCAACC CGCCGGGACT GGGTTCGTCC GGCCAGATCG CTGCCTCGAC GCTGGAAAGC
TCGAATGTCG ATCTCGCCAA TGAATTCACC AGCCTGATCA CCACACAGAG GGCCTACTCT
GCCTCGTCGA AGATCATCAC CACCGCTGAT GAAATGCTCG ACGAAGCCAT CCGAATGAAG
CGCTAA
 
Protein sequence
MSINSALLAG ASGLISNSNA LASISDNIAN VNTVGYKRAS AVFTPLYDSD SSTKGYSAAG 
VNSVARLAIG ESGLLTAGSS PTDLAISGRG FFVVRDDANV NSSDPVSFTR AGRFTPDTNG
YLRNDAGKYL SGWPVAADGS VPQNPSDLNA LETINLSSIG GAAEATTIMG INANLQQSQA
ISADEATYDA TASATNMSSG TVTPDFQRSI PFYDSVGGVR TLTISMLKSS TPNQWHAEVH
MVPATDLTTG AGLVDGQMLT GTVAFDAQGR IDSANTTLPT QLDFLSSTNA AALGATEFQW
AAATGIDAQS IALDFGSPNA PGGFTQYDSP SALLSTNVNG SAFGNFSSVD VDDDGFVFAK
FNNGIVRKIY QIPVATFVNP DGLEAQSGGT FTVTPESGAY TLNPPGLGSS GQIAASTLES
SNVDLANEFT SLITTQRAYS ASSKIITTAD EMLDEAIRMK R