Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CCV52592_1149 |
Symbol | |
ID | 5407234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter curvus 525.92 |
Kingdom | Bacteria |
Replicon accession | NC_009715 |
Strand | + |
Start bp | 937803 |
End bp | 939623 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640872392 |
Product | arylsulfotransferase |
Protein accession | YP_001408212 |
Protein GI | 154174611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG AGTTGATTTC TTTTGCAGTT GCTGCGACCT TGTGCGCGGC TTTGACTCCG GCTTTAGCCG CAGGCGGCGC GAGCGGTCCG AAAACTTACC GAGGCGTAGG GCAGATCGGC GCAGTAGTCA TGAATCCTTA CAAAGTCGCC CCTCTGACCG CCGTCATAAA AAGCGCAGGC TACACGCTCT CAGACATCAA AGTAACCGTA AAAGCCAAAA AAGACGGCGT GGATATCAGC TATGACGTAT CCGATCAAAA AGCGCTTCAG CATGGTGGTA TTCCCGTTTG GGGCTTGTAT CCGGACTATG TGAACGAGGT CGCCGTGAGC TATAAGAAAA ATGGAGAGCC GGTGAGCGAA ACGTATAAAA TTTACGCTCC AGCCGTTGCG GTGTATGGCT CGGGGACGAA TCAAACACAA GCACTTCCGA CAGCCGTGGT GACAAAGGAG AATCCGAAAT TTCATAAAAA TTTATACCTG ATGAATCACC TAAGCTCGAC TCTGCCAAAC GCAGCTCAGG TGACGTGGAA CAACCCGGCT GGCGGCGCGC TGGAGTGGGA TTATGAAAGC TACGTTTGGA TAGTCGATGG CAAGGGCGAT ATCAGGTGGT ATCTAAAAAC GGATGAATTC AGAGACACCA ACAATATCAT GAAAAAGGGC AATATGATGG GCTTTGACCA GACCAAGGAC GGTGACATCG TCTGGGGTCA GAGTCAGACC TACAACCGCT TTGACCTCAT GGGCCGCAAA ATTTTCCAAC GCGAACTGCC AAAAAGCTAC ATCGACTTCT CTCATCATAT GGAAGAGACC GCCAAAGGCA CGTTTTTGAT GCGCGTAGCC TCTGGTGATT ATAAACGCAA AGATGGCAAA AACGTCCGCA CCGTGCGCGA CGTGATCATC GAGCTTAACA AGGACGGGAA CGTGCTTGAC GAGTGGAAGA TAATGGAAAT TCTAGATCCC TATCGAGCGA CGAATCTGCT CGTGCTAGAT CAAGGCGCAG TCTGCTTAAA CGTCGATGCG AGCAAAGCGG GGCAAACAGC CAGCAAAGAA GAGCTTGAAG ACGACAAAGC TCCTTGGGGC GATGTCACCG GAGTGGGCGC AGGACGCAAT TGGGCGCATA TAAATTCCGC CAACTACGAC CCTGCCGACG ATAGTATCAT CATCTCGATC CGCCACCAAA GCGCAGTCGT CAAAATCGGA CGCGACAAAA AGGTCAAATG GATATTATCT TCGCCTGAGG GCTGGAACAA AGAGCTATCC GCCAAGCTGT TAAAGCCCAT AGACGCAAAC GGCAAGAAAA TAGACTGCGG CGAGAGCGGC TCGAAGTGCC CGGGATATAC GAGCGAAAAG GGCGGATTTG ACTACACTTG GACGCAGCAC ACGGCGTTTA AAGTAAATGA AAAAAGCAAG GGCGACAAGA TCCATGTGAG CGTCTTTGAC AACGGCGATA CACGCGGTAT GGAGCAGCCT GCATTACCAA ATATGAAATA CTCTCGCGCC GTCGAATATC TCATAGATGA GAAAAATATG ACCGTGCAAC AAGTCTGGGA ATACGGCAAG GAGCGCGGCT TTGAGTGGTA CAGCCCGATA ACTTCAGTCG TGGAGTATCA GCCATCGCGC GACACGATGT TTGTGTATTC TGCGACTGCG GGAATGGGCG ATGTAAAGGC GTTTAGAAGG GGCGAGGCGA AGCTCACGCC GTTTTTAGAA GAGATAAAAT ACGGCACGAA AGAGGTCGAA TTCGAGATGA AATTTATAAA CTCGAACACG ATCGGCTATC GCTCGCTGCC AATCGATCTG CAAAAGGCGT TTAAAAAATA A
|
Protein sequence | MKKELISFAV AATLCAALTP ALAAGGASGP KTYRGVGQIG AVVMNPYKVA PLTAVIKSAG YTLSDIKVTV KAKKDGVDIS YDVSDQKALQ HGGIPVWGLY PDYVNEVAVS YKKNGEPVSE TYKIYAPAVA VYGSGTNQTQ ALPTAVVTKE NPKFHKNLYL MNHLSSTLPN AAQVTWNNPA GGALEWDYES YVWIVDGKGD IRWYLKTDEF RDTNNIMKKG NMMGFDQTKD GDIVWGQSQT YNRFDLMGRK IFQRELPKSY IDFSHHMEET AKGTFLMRVA SGDYKRKDGK NVRTVRDVII ELNKDGNVLD EWKIMEILDP YRATNLLVLD QGAVCLNVDA SKAGQTASKE ELEDDKAPWG DVTGVGAGRN WAHINSANYD PADDSIIISI RHQSAVVKIG RDKKVKWILS SPEGWNKELS AKLLKPIDAN GKKIDCGESG SKCPGYTSEK GGFDYTWTQH TAFKVNEKSK GDKIHVSVFD NGDTRGMEQP ALPNMKYSRA VEYLIDEKNM TVQQVWEYGK ERGFEWYSPI TSVVEYQPSR DTMFVYSATA GMGDVKAFRR GEAKLTPFLE EIKYGTKEVE FEMKFINSNT IGYRSLPIDL QKAFKK
|
| |