Gene Mmar10_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1051 
Symbol 
ID4285332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1147859 
End bp1149469 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content65% 
IMG OID638140522 
Productsulfotransferase 
Protein accessionYP_756282 
Protein GI114569602 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTC AAACCGAATC GCAACTTGAA TCGGCAGCGG CGGCATTGGG CGCCCGAGAC 
TATCGCCAAG CCCACCGGCT GTGCATGTCG GTGTTGCAGA CCCAGGGGCC GAATGCCGAG
GCGTTTTTCC TGCTCGGATT GCTCACGGCC GATCACGATA ATCACGCCAA GGCGGTGGAT
ATATTGGACA GGGCAATCGC CCTGGACAGC GGGGACTCCC GATTTCATGC CCATCGCGCC
AAGTCATTGC TGGCGCTTCA ACGCCGCGAC GCGGCGGCTG ATGCGGCGCA AAAAGCGGCG
GAATTGGGTC CCCGCGATGC GTTGACCTTT GACACGATCG GCGTGGTCTT CACGCGTCTC
GGCGATCATG CCTCGGCCAT CGGACTGTTC GAAGCGGCGG TCAGGCAGGA CGGCCAGCGG
GCGGCGTATC ACTACAATCT CGCCGCGTCG CGCCAGTTCG CTGGCGCATT CGATCTCGCT
GCCGCTGGCT ATGAGCGTAC GCTCGAGCTG GAACCCGGCC ACGTCAAGGC GTTGTCAGCG
GTGGTCGGCT TGCGACGCCA GACCGAGGCC GACAACCGGC TCGATGCGCT GGAATCGGCG
TTCAAGGCGC GAGACAAAAA TGATCAGGAA GCCCAGCTGC ATCTCGGCCA TGCCATTGCC
AAGACGCTTG AAGACCTGGG TCGGCACGAC GAGGCGCTCG ACTGGCTGGG CCGCGCCAAG
GCGGTTGTCA GTGCGGTACG CCAGTACGAC GCCCGGGTCG ATGCCGATCT GTTTGCTGCG
GCGGCCCGCA CGAGCGAGAC GAGCCCCAGC CCGGCCGCGC CCGGCTGGGA CAGCGACCGG
CCAGTGTTCA TTGTGGGGCT TCCCCGGACC GGGACGACAC TGGTCGACCG GATTCTGGCG
GCGCATCCCG CGGTTCGACC GCTGGGTGAG TTGTCGAACT TTGCCCTCCT GGCGAAGCAG
ATGGCCGGGA CGCCAGGCCC GTATGTCATG GATGCGGCGA CGATCGGGGC GACAACGTCC
ATCGACCCGA AAGCCCTTGG CCAGGCCTAT GAGGCCAGCG TGGCCGGCCT CGCCGGTGAC
GCCGCGCGCT ATACCGACAA GATGCCGCTC AATATTGTCT GGGCCGGGCA CATCCATCGC
GCTTTGCCGA ATGCCCGCAT CATCTGTCTG CGCCGCCACC CGCTCGACGC CTGTCTCAGC
AATTACCGCC AGCTCTTCGC GACCAGCTTT TCCTATTACA ATTACGCCTA CGAGCTGACC
GACTGCGCCC GCTACTATCT CGAGTTTGAC CGGCTGCGAG GCCATTGGGC GGCCAGCCTT
CCGGCCGACC GCTATACCGA AGTCGCTTAT GAGGACATTG TCGGTGACCT GGAGGGCGAG
GCAAGGCGCC TGATCGAACA TTGCGGCCTT GACTGGGACC CGGCCTGCCT CGACTTCCAC
AACCAGGCCG GCAGTGTGGC AACCGCGAGC TCGGTCCAGG TCCGCCAGCC GCTCTATTCC
AGCTCAATCG GTCGTTGGCA ACGGCACGCC GAAGCGCTGA TGCCGGTGCG GTCCATTCTC
CGCGATGGCG GTGTGATCGA CGCGGACGGA AACTGGCTGC GCGACACCTG A
 
Protein sequence
MKPQTESQLE SAAAALGARD YRQAHRLCMS VLQTQGPNAE AFFLLGLLTA DHDNHAKAVD 
ILDRAIALDS GDSRFHAHRA KSLLALQRRD AAADAAQKAA ELGPRDALTF DTIGVVFTRL
GDHASAIGLF EAAVRQDGQR AAYHYNLAAS RQFAGAFDLA AAGYERTLEL EPGHVKALSA
VVGLRRQTEA DNRLDALESA FKARDKNDQE AQLHLGHAIA KTLEDLGRHD EALDWLGRAK
AVVSAVRQYD ARVDADLFAA AARTSETSPS PAAPGWDSDR PVFIVGLPRT GTTLVDRILA
AHPAVRPLGE LSNFALLAKQ MAGTPGPYVM DAATIGATTS IDPKALGQAY EASVAGLAGD
AARYTDKMPL NIVWAGHIHR ALPNARIICL RRHPLDACLS NYRQLFATSF SYYNYAYELT
DCARYYLEFD RLRGHWAASL PADRYTEVAY EDIVGDLEGE ARRLIEHCGL DWDPACLDFH
NQAGSVATAS SVQVRQPLYS SSIGRWQRHA EALMPVRSIL RDGGVIDADG NWLRDT