Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2649 |
Symbol | |
ID | 4285980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 2886289 |
End bp | 2888298 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638142148 |
Product | sulfotransferase |
Protein accession | YP_757873 |
Protein GI | 114571193 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGAAG AGACGGACAT CCATGTCCAG GCGCTGACGG CAGCGCAAAC GCAGATGTAT GAAGGCCGCT TTGATGCGGC CCGGGAGACG CTGCAACCGG TTCTGGATGC GTCACCGGAC CATGTCGATG CGCTCTACAT GCAGGCGGTC TGCGCCCGTT ACCTCAAACG CCATGACGAA GCCCGCGCCG CTCTCGAGCG TATCAAGGGC GTCTCGCCAG ATTTTGGCCG TGCCTATCAG GAAGAAGGCC ATCTCCTGCG CGCGCTGGGC GAGGATGATC GCGCTCTGAC TGCCTATCAA CGCGCTTGCC GGTTCAACCC CGCCCTTGTT GCCAGCTGGC GGGCTCAATC AGACCTCTTG CAGGCGGCAG GGCGTCCGGC GGAGGCCAGC AACGCAGCGG CACAGGCGGA ACGGATTGCC GCACTGCCGC GGGACCTGGT CTCGGTCACC CATCTCCTGC ACGAGGGCAA GCTCCTGAAG GCCGAACAGC TGTGTCGGGC CTTTCTCCAG AAGACCCCGC ACCATGTCGA AGCCATGCGA TTGCTGGCGG AAATCGGTTC GCGCTTCGGC GTGCTTGAAG ATGCTGACTT CCTGCTCGAG AGCGCCATCG GTTTTGAACC GGACAACACC CAGCTGCGGC TCGACTACAT CCAGATCCTG CGCAAGCGCC AGAAGTTCGC GGCCGCCCTC GAACAGGCCC GACAGCTCTG GGAGACCGAT CGCGATAATC CGGTCTTCAA ATCCCACTAC GCCATCGAGC GCATGCAGAC CGGCAGCTAT GACGAAGCCC TGACCCTGTT TGAGGAGATA CTCGTCACAC TGCCCGACGA CCCGGCGACG CTGACCTCAC TCGGGCACGC CCAGAAGACA CTTGGCCAGC ATGACGCCGC GGTTGCCAGC TACCGGGCGG CCTTTGCAGC CAAACCTGAC CATGGCGACG CCTGGTACGG GCTGGCCAAT CTGAAGACCT ATCGCTTCAC CGATGAAGAG GTCGCGTCCA TGCAGGCGCT GGAAGCTGGC AGCGATCTGG CTTTCCAGGA CCGGGTCCAT CTCAGCTTTG CCCTCGCCAA GGCCTTCGAG GATCACGAAG ACGTCGCGCA GGCGTTCGAC TTCTACGAAA AGGGCAATAC GCTCAAGCGG GTCCAGACCC GCTACACCAC CGAGCAGATG AAGGCCGAGC TCGATGCCCA GGCCGAAATC TGTGACGCGG CGCTGTTTGC CCGCCAGTCG GGCAAGGGAT GCGCCGATCC TGATCCAATC TTCATTGTTG GACTGCCCAG GGCCGGCTCC ACCCTGCTGG AACAGATCCT GGCCTCGCAC AGCCAGGTCG ACGGCACGTT GGAACTTCCC AACATCCTGG CCCTGTCGCA TCGATTGCGC GGACGACAGC GGTTGAGTGA CAAGACTCGC TATCCTCGGG TGCTCCACGA GCTGGATGCC GGGCAACTGG AAGCGCTGGG CCGGGACTAT ATCGAAAACA CCCGCATCCA TCGTGCCGGT GCGCCGCGCT TCACCGACAA GATGCCAAAC AATTTCCGGC ATATCGGCCT GATCAAACTG ATCCTGCCCA ATGCCCGCAT CATTGATGCC CGCCGGCATC CGATGGCCTG CTGTTTCTCC GGCTTCAAGC AATTATTCGC CGAGGGCCAG GAATTCACCT ACGGGCTCGA GGAAATCGGG CACTATTACC GCAATTATGT TGCGCTGATG GATCATTGGG ATCGGGTTCT TCCCGGCCAG ATCCTGCGCG TGAACTATGA GGACGTCGTG TCTGACCTTG ACGGGCAGGT TCGCCGTATT CTCGACTATT GCGGCCTGCC ATTCGAGCAG GCCTGTATCG ACTTCCACGC GACCGAGCGG GCGGTCCGGA CGGCCAGTTC GGAACAGGTC CGCCAGCCGA TCTTCGACGC CGGTGTGGCG CAGTGGAAGA AATTCGAACC CCATCTCGAC CCGCTGAAAA GCGCGCTGGG TAAAGACATA CTGGCCCGAG CGGGACAAGG AACATCATGA
|
Protein sequence | MAEETDIHVQ ALTAAQTQMY EGRFDAARET LQPVLDASPD HVDALYMQAV CARYLKRHDE ARAALERIKG VSPDFGRAYQ EEGHLLRALG EDDRALTAYQ RACRFNPALV ASWRAQSDLL QAAGRPAEAS NAAAQAERIA ALPRDLVSVT HLLHEGKLLK AEQLCRAFLQ KTPHHVEAMR LLAEIGSRFG VLEDADFLLE SAIGFEPDNT QLRLDYIQIL RKRQKFAAAL EQARQLWETD RDNPVFKSHY AIERMQTGSY DEALTLFEEI LVTLPDDPAT LTSLGHAQKT LGQHDAAVAS YRAAFAAKPD HGDAWYGLAN LKTYRFTDEE VASMQALEAG SDLAFQDRVH LSFALAKAFE DHEDVAQAFD FYEKGNTLKR VQTRYTTEQM KAELDAQAEI CDAALFARQS GKGCADPDPI FIVGLPRAGS TLLEQILASH SQVDGTLELP NILALSHRLR GRQRLSDKTR YPRVLHELDA GQLEALGRDY IENTRIHRAG APRFTDKMPN NFRHIGLIKL ILPNARIIDA RRHPMACCFS GFKQLFAEGQ EFTYGLEEIG HYYRNYVALM DHWDRVLPGQ ILRVNYEDVV SDLDGQVRRI LDYCGLPFEQ ACIDFHATER AVRTASSEQV RQPIFDAGVA QWKKFEPHLD PLKSALGKDI LARAGQGTS
|
| |