Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2389 |
Symbol | |
ID | 4286521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2600678 |
End bp | 2602591 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638141893 |
Product | glycosyl transferase family protein |
Protein accession | YP_757619 |
Protein GI | 114570939 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.476236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCGAT CCGATTTGAT GACCGATCTT GCCCGTGGGA TGCGCGCCTG GTGGGTCATC GGCCTGTTGG CAGCGCTTTC GGCCCTGGCC GGCGTCTTCA CCCTGCCACC AATCGACCGT GACGAAAGCC GCTATGCCCA GGCCACGGCG CAGATGCTCG AGACCGGCAA TTACATCGAG ATCAATTATC TCGACGAGCC GCGCAACAAG AAGCCGGTCG GCATTTACTG GCTGCAGGCC GCTGCCGTCG CGCTGACCTC TGACGCCGGT GACCGCCAGA TCTGGGCCTA CCGCCTGCCG TCAGTGCTCG GCGCGATCCT GGCCGCGTTG GCCACCTTCT GGGCCGGGCA GCGACTGGTC GGTCGCGAAG CCGCTTTTGC CGGTGCCGCC CTGCTGGCAA CCACCGTCCT GTTGGGCATA GAAGGCGGGA TCGCCAAGAC TGACGGGGTG TTGGTCGGTG TCACAACGCT GGCGATGGCC GCGCTGGCCA ATGCGCGCAG CGGTGATCGG CCCGGCTGGC GGACGGCCTT GCTGTTCTGG TCAGCCATCG GGCTGGGCGT ATTGATCAAG GGGCCCGTCG CCCCGATGGT GGCCGGCGTG TCCGTCCTCA CCCTGGTGGT CTGGGAGCGC AAGATCGCCT GGCTGAAACC GGTCCTGGTC TGGTGGGGAC CGATCCTCAC CGGCCTGATT GTCCTGCCCT GGCTGATCTC GATCCAGCTC GCAACCGATG GAGCTTTCCT GCGCGACGCG CTCGTCGGAG ATCTTGGTCC CAAGCTGGTG TCAGGCCATG AGCGACATGG GGGTCTTCCC GGCTATCATC TTCTGGTTCT TCCTGTGCTC TTCTTCCCGG CGACGCTCTT CCTCATCCCT GGCGCGGGTC GGATGGTTTC GGCGCTGCGA GGCGATGATG ACCGCCTGGC TTCAGCCGCC CGTTTCCTGA TCGCCTGGGC GGTCCCGACC TGGGTCCTGT TCGAGCTCCT GCCGACCAAG CTGCCTCACT ACGTCCTGCC GCTCTATCCG GCGCTGGCAC TGGCGGCAGG CTGGGGGCTG GTCGAGCTGG GCAAGGCCGC GCACTGGCAA CGCTTGGCCG GCTGGGCCCT GTTCGCGATC GGCGCGGGCG TGTTCGCGAT CTTTCTTCCC TATGTCTTCA TCACCTATGG CAATAATGCC AGTTGGGACG CCATCCGCCT CGCCCAGGCC GGCTTTGAGG GCGGCTTCCA GTTGGGACTG GACCCATATG CCGCCGCCTG GGTGTTCGGA TCGGGCGCGC TCTTCCTGGC CCTGTCAGCG GCGACCCTGA CCACCGACCG GTTGCGCCCG GGCGTTCTGG CGCTTGTCTT TGCCGTCCTG TCCGGTTTGG GCTGGCAGGT CGCGGCGCGT TCGGGGGCGT TCGCTGAAGC TTATGCGGTC CGCCTGGCCG ATCAGGTACG TGCCGCCCGG GCCTATTCGG AGACGATCAC CGGCTTGTCG CCCGAGGACA TCGTGACGGC CTCCAGCTTC ACCGAACCCA GCCTCGCCTT CTCGCTGGGT TCGGACACGG TGTTGGGGAC AACCGAAGAA GTCTTGGCCT TTGCCGAGGG CCGGGACGAG CCAACGATGT TGGTGCTGGA CCTGTCACGG GATGCGGAGC TACGGGCCGA TCTGAGAACC GAGGCGCGTT CGGTCTATGA GTTGAGACTG GAAATGATCG CAACCGAATG GCGGCCTGAA TTCTCTTCCC CTGTCCCGCA AGAGCCGCCT TGGATGGCGG CAGACCGCCT TCGCGGCGAG CGGCTCGCCT GGATGAGAGA ATTGGGTGTC TGCCACCACA CGCTTGCATC CGGCACCAAT TACGCGCGTG GAACCAATAC CGTGCTGGTT ATCCTGTTCA CCCGCTGCGC CCCAGAGGAC ACCCCCAATG ACCCGCAAGA TTGA
|
Protein sequence | MVRSDLMTDL ARGMRAWWVI GLLAALSALA GVFTLPPIDR DESRYAQATA QMLETGNYIE INYLDEPRNK KPVGIYWLQA AAVALTSDAG DRQIWAYRLP SVLGAILAAL ATFWAGQRLV GREAAFAGAA LLATTVLLGI EGGIAKTDGV LVGVTTLAMA ALANARSGDR PGWRTALLFW SAIGLGVLIK GPVAPMVAGV SVLTLVVWER KIAWLKPVLV WWGPILTGLI VLPWLISIQL ATDGAFLRDA LVGDLGPKLV SGHERHGGLP GYHLLVLPVL FFPATLFLIP GAGRMVSALR GDDDRLASAA RFLIAWAVPT WVLFELLPTK LPHYVLPLYP ALALAAGWGL VELGKAAHWQ RLAGWALFAI GAGVFAIFLP YVFITYGNNA SWDAIRLAQA GFEGGFQLGL DPYAAAWVFG SGALFLALSA ATLTTDRLRP GVLALVFAVL SGLGWQVAAR SGAFAEAYAV RLADQVRAAR AYSETITGLS PEDIVTASSF TEPSLAFSLG SDTVLGTTEE VLAFAEGRDE PTMLVLDLSR DAELRADLRT EARSVYELRL EMIATEWRPE FSSPVPQEPP WMAADRLRGE RLAWMRELGV CHHTLASGTN YARGTNTVLV ILFTRCAPED TPNDPQD
|
| |