Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1051 |
Symbol | |
ID | 4285332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1147859 |
End bp | 1149469 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140522 |
Product | sulfotransferase |
Protein accession | YP_756282 |
Protein GI | 114569602 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCTC AAACCGAATC GCAACTTGAA TCGGCAGCGG CGGCATTGGG CGCCCGAGAC TATCGCCAAG CCCACCGGCT GTGCATGTCG GTGTTGCAGA CCCAGGGGCC GAATGCCGAG GCGTTTTTCC TGCTCGGATT GCTCACGGCC GATCACGATA ATCACGCCAA GGCGGTGGAT ATATTGGACA GGGCAATCGC CCTGGACAGC GGGGACTCCC GATTTCATGC CCATCGCGCC AAGTCATTGC TGGCGCTTCA ACGCCGCGAC GCGGCGGCTG ATGCGGCGCA AAAAGCGGCG GAATTGGGTC CCCGCGATGC GTTGACCTTT GACACGATCG GCGTGGTCTT CACGCGTCTC GGCGATCATG CCTCGGCCAT CGGACTGTTC GAAGCGGCGG TCAGGCAGGA CGGCCAGCGG GCGGCGTATC ACTACAATCT CGCCGCGTCG CGCCAGTTCG CTGGCGCATT CGATCTCGCT GCCGCTGGCT ATGAGCGTAC GCTCGAGCTG GAACCCGGCC ACGTCAAGGC GTTGTCAGCG GTGGTCGGCT TGCGACGCCA GACCGAGGCC GACAACCGGC TCGATGCGCT GGAATCGGCG TTCAAGGCGC GAGACAAAAA TGATCAGGAA GCCCAGCTGC ATCTCGGCCA TGCCATTGCC AAGACGCTTG AAGACCTGGG TCGGCACGAC GAGGCGCTCG ACTGGCTGGG CCGCGCCAAG GCGGTTGTCA GTGCGGTACG CCAGTACGAC GCCCGGGTCG ATGCCGATCT GTTTGCTGCG GCGGCCCGCA CGAGCGAGAC GAGCCCCAGC CCGGCCGCGC CCGGCTGGGA CAGCGACCGG CCAGTGTTCA TTGTGGGGCT TCCCCGGACC GGGACGACAC TGGTCGACCG GATTCTGGCG GCGCATCCCG CGGTTCGACC GCTGGGTGAG TTGTCGAACT TTGCCCTCCT GGCGAAGCAG ATGGCCGGGA CGCCAGGCCC GTATGTCATG GATGCGGCGA CGATCGGGGC GACAACGTCC ATCGACCCGA AAGCCCTTGG CCAGGCCTAT GAGGCCAGCG TGGCCGGCCT CGCCGGTGAC GCCGCGCGCT ATACCGACAA GATGCCGCTC AATATTGTCT GGGCCGGGCA CATCCATCGC GCTTTGCCGA ATGCCCGCAT CATCTGTCTG CGCCGCCACC CGCTCGACGC CTGTCTCAGC AATTACCGCC AGCTCTTCGC GACCAGCTTT TCCTATTACA ATTACGCCTA CGAGCTGACC GACTGCGCCC GCTACTATCT CGAGTTTGAC CGGCTGCGAG GCCATTGGGC GGCCAGCCTT CCGGCCGACC GCTATACCGA AGTCGCTTAT GAGGACATTG TCGGTGACCT GGAGGGCGAG GCAAGGCGCC TGATCGAACA TTGCGGCCTT GACTGGGACC CGGCCTGCCT CGACTTCCAC AACCAGGCCG GCAGTGTGGC AACCGCGAGC TCGGTCCAGG TCCGCCAGCC GCTCTATTCC AGCTCAATCG GTCGTTGGCA ACGGCACGCC GAAGCGCTGA TGCCGGTGCG GTCCATTCTC CGCGATGGCG GTGTGATCGA CGCGGACGGA AACTGGCTGC GCGACACCTG A
|
Protein sequence | MKPQTESQLE SAAAALGARD YRQAHRLCMS VLQTQGPNAE AFFLLGLLTA DHDNHAKAVD ILDRAIALDS GDSRFHAHRA KSLLALQRRD AAADAAQKAA ELGPRDALTF DTIGVVFTRL GDHASAIGLF EAAVRQDGQR AAYHYNLAAS RQFAGAFDLA AAGYERTLEL EPGHVKALSA VVGLRRQTEA DNRLDALESA FKARDKNDQE AQLHLGHAIA KTLEDLGRHD EALDWLGRAK AVVSAVRQYD ARVDADLFAA AARTSETSPS PAAPGWDSDR PVFIVGLPRT GTTLVDRILA AHPAVRPLGE LSNFALLAKQ MAGTPGPYVM DAATIGATTS IDPKALGQAY EASVAGLAGD AARYTDKMPL NIVWAGHIHR ALPNARIICL RRHPLDACLS NYRQLFATSF SYYNYAYELT DCARYYLEFD RLRGHWAASL PADRYTEVAY EDIVGDLEGE ARRLIEHCGL DWDPACLDFH NQAGSVATAS SVQVRQPLYS SSIGRWQRHA EALMPVRSIL RDGGVIDADG NWLRDT
|
| |