Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1855 |
Symbol | |
ID | 5899310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1978242 |
End bp | 1980419 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562345 |
Product | sulfotransferase |
Protein accession | YP_001683482 |
Protein GI | 167645819 |
COG category | [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACTC CGCAGCAAAA CATGACCGTC GAGCAGGCGC TGGCCCAGGC CCATGGCCAC TGGAACGCCG GTCAGGTCGA TCAGGCGGAG CGGCTTTGCC AGCAGGTGCT CGCCCTGTGG CCCGGACAGT CGGACGCCCT ACATCTGATG GGGTTGATGG CTCATGCCTA TGGCAATCTG GACCTGGCGA TCGACCATCT GCGCCATGCC TGCCTGGCGC CGCGCGCACC CGCCCAGTAC CTGAGCAACC TGGCCGAGAT GTGCAGGCAG GCCGGGCGCC TCGCCGAGGC TGAGCAGGCT GCCCGCCGGT CGGTGTCCAT GGACAGCAGC CTGGTGGCCG GGTGGAACAA TCTCGGGATC GTCCTGCAGG AAGCCGGCAA GCTTGAGGAG AGCTTGACCT GCCTCGAGCG CGTGGTCGCG CTGCAGCCCG ACTATGCCGA GGCCCACAAC AACCTGGGCA ACACGCTGAA GCGGCTGGGG CGGCTGGACC AGGCCCGCGC CCGGTACGAG CAGGCCCTCA AGCTCGCGCC GGCCTATGCT GAAGCGCTCA GCAATCTCTC AAACCTGCTC AATGATCTCG GATTGTCTGA CGAGGCGCTC GCCTCGGCGC GTCGCGCGAT CGACGCCAAT CCCCGCCTGT CTGACGCCTA CATCAATGCG GCCGCGGTCG AAGTGGCGCG TGACCGCTAC GATGAGGGCT TGCGCTGGGT CGATGCGCTG CTGGTCTACG CGCCCCTGCA TGCGGGAGCT CTGGGGGTGC GCGCCACGAT CCTGCGCCGC CTGGGCCGAC TGGACGAAGC CCTGGTCGAA GCCCGCCGCG CCCTCGCGAC GGCGCCGGAC AACGGCGAGG CGCTGAATAC GCTGGGTGAG GTGCTGCAGG CGCAGGACAA GATGGACGAG GCGCTGGCGG CCTATGATCG GGCCGCTCAG TCGCTGGGCT TTGCGCCCGA AAAGGCCCTG GTCAATCGCG CCATCCTGCT GATGGAGCGG GGGGACACGG AGGCGGCGAA GGCAGCTTTC GATGACGTGC TGGAGCATTT CCCGCGCTCG GCGTCCGCCT GGTTCAACCG GGCTGACCTG CACCGTTTCG TGCCCGGCGA CCCCGCCATC GGCGCGATGG AGGCCTTGAT CGGCCCCGGG GGCGTCCAGA ACCAGGCCGA TCGCACCGCG CTGCATTTCG CCCTCGGCAA GGCATGGATG GATGTCGGCG ACGCCGAGCG GGCGTTTCGC TATCTCGACG AGGGCAATCG TCAGAAGCGC GCGACCTTCG CCTACGACCC GAACGCCATC GACCGCTGGT TCTCGGACAT CATCGCCGCC TTCCCCTCGG AGATGATCCA ACGGCCCGAG GCCGCGACGC CTGGCAGCGA TCTAGCGGTG TTCGTGATCG GCATGCCGAG GTCCGGAACC ACGCTGGTCG AGCAGATTCT GGCGTCGCAC CCGGACGTTC TGGGCGCGGG CGAAATGACC ACCCTGCAAA ACATCGTGAA CACGGCGGGA GGGTATCCGG CCATCGCGCA ACAGCTGACT CCGGAAAACG AGGCCGCTCT GGGAGGGCTC TATCTGGACG CCGTGCGGCC GCTCGCGGGC GATCATCACC GACTGGTCGA TAAGATGCCG TCCAACTTCC TGTTTGCAGG CCTGATCAAT CGGATCCTGC CGCAGGCGCG GATCATCCAT GTGCGGCGCG ATCCCGCCGA CACCTGCCTG TCAAGCTACA GCAGGCTGTT CTCGCGCGAG CAGCTGTTCT GCTACGATCA GTCGGAGCTG GCGCGTTTCT ATCAGAACTA CGAGCGCCTG ATGGATCACT GGCGCGCGGT GCTTCCGGCC GATCGCTTCA TTGAGGTCCG CTATGAGGAC CTAGTGGACG ATATTGAGCA TGAAGCCCGG CGCCTGACGG ACTTCTGCGG GCTCGACTGG AGCCCGGCGT GCCTCGATTT CCACCAGACC TCGCGGACGA TCCGCACCGC GAGCCTCAAT CAGGTTCGCC GTCCCCTCTA TGCCAGCAGC ATCGGACGCT GGCGTGCGTA TGCCCGCCAG CTTGGACCCT TGCTGACGGG GCTCGGGATC GATCCTGAGG CTGTCGCCGC GCCGACGGTC GGTCGCAAGA CCGCTGCCGG CAAACGCGCG GGCAAGGGCG CCGGAAAGCG CGATCAAAAA CCATCGATCG CCAGTTGA
|
Protein sequence | MNTPQQNMTV EQALAQAHGH WNAGQVDQAE RLCQQVLALW PGQSDALHLM GLMAHAYGNL DLAIDHLRHA CLAPRAPAQY LSNLAEMCRQ AGRLAEAEQA ARRSVSMDSS LVAGWNNLGI VLQEAGKLEE SLTCLERVVA LQPDYAEAHN NLGNTLKRLG RLDQARARYE QALKLAPAYA EALSNLSNLL NDLGLSDEAL ASARRAIDAN PRLSDAYINA AAVEVARDRY DEGLRWVDAL LVYAPLHAGA LGVRATILRR LGRLDEALVE ARRALATAPD NGEALNTLGE VLQAQDKMDE ALAAYDRAAQ SLGFAPEKAL VNRAILLMER GDTEAAKAAF DDVLEHFPRS ASAWFNRADL HRFVPGDPAI GAMEALIGPG GVQNQADRTA LHFALGKAWM DVGDAERAFR YLDEGNRQKR ATFAYDPNAI DRWFSDIIAA FPSEMIQRPE AATPGSDLAV FVIGMPRSGT TLVEQILASH PDVLGAGEMT TLQNIVNTAG GYPAIAQQLT PENEAALGGL YLDAVRPLAG DHHRLVDKMP SNFLFAGLIN RILPQARIIH VRRDPADTCL SSYSRLFSRE QLFCYDQSEL ARFYQNYERL MDHWRAVLPA DRFIEVRYED LVDDIEHEAR RLTDFCGLDW SPACLDFHQT SRTIRTASLN QVRRPLYASS IGRWRAYARQ LGPLLTGLGI DPEAVAAPTV GRKTAAGKRA GKGAGKRDQK PSIAS
|
| |