Gene Caul_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1855 
Symbol 
ID5899310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1978242 
End bp1980419 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content67% 
IMG OID641562345 
Productsulfotransferase 
Protein accessionYP_001683482 
Protein GI167645819 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACTC CGCAGCAAAA CATGACCGTC GAGCAGGCGC TGGCCCAGGC CCATGGCCAC 
TGGAACGCCG GTCAGGTCGA TCAGGCGGAG CGGCTTTGCC AGCAGGTGCT CGCCCTGTGG
CCCGGACAGT CGGACGCCCT ACATCTGATG GGGTTGATGG CTCATGCCTA TGGCAATCTG
GACCTGGCGA TCGACCATCT GCGCCATGCC TGCCTGGCGC CGCGCGCACC CGCCCAGTAC
CTGAGCAACC TGGCCGAGAT GTGCAGGCAG GCCGGGCGCC TCGCCGAGGC TGAGCAGGCT
GCCCGCCGGT CGGTGTCCAT GGACAGCAGC CTGGTGGCCG GGTGGAACAA TCTCGGGATC
GTCCTGCAGG AAGCCGGCAA GCTTGAGGAG AGCTTGACCT GCCTCGAGCG CGTGGTCGCG
CTGCAGCCCG ACTATGCCGA GGCCCACAAC AACCTGGGCA ACACGCTGAA GCGGCTGGGG
CGGCTGGACC AGGCCCGCGC CCGGTACGAG CAGGCCCTCA AGCTCGCGCC GGCCTATGCT
GAAGCGCTCA GCAATCTCTC AAACCTGCTC AATGATCTCG GATTGTCTGA CGAGGCGCTC
GCCTCGGCGC GTCGCGCGAT CGACGCCAAT CCCCGCCTGT CTGACGCCTA CATCAATGCG
GCCGCGGTCG AAGTGGCGCG TGACCGCTAC GATGAGGGCT TGCGCTGGGT CGATGCGCTG
CTGGTCTACG CGCCCCTGCA TGCGGGAGCT CTGGGGGTGC GCGCCACGAT CCTGCGCCGC
CTGGGCCGAC TGGACGAAGC CCTGGTCGAA GCCCGCCGCG CCCTCGCGAC GGCGCCGGAC
AACGGCGAGG CGCTGAATAC GCTGGGTGAG GTGCTGCAGG CGCAGGACAA GATGGACGAG
GCGCTGGCGG CCTATGATCG GGCCGCTCAG TCGCTGGGCT TTGCGCCCGA AAAGGCCCTG
GTCAATCGCG CCATCCTGCT GATGGAGCGG GGGGACACGG AGGCGGCGAA GGCAGCTTTC
GATGACGTGC TGGAGCATTT CCCGCGCTCG GCGTCCGCCT GGTTCAACCG GGCTGACCTG
CACCGTTTCG TGCCCGGCGA CCCCGCCATC GGCGCGATGG AGGCCTTGAT CGGCCCCGGG
GGCGTCCAGA ACCAGGCCGA TCGCACCGCG CTGCATTTCG CCCTCGGCAA GGCATGGATG
GATGTCGGCG ACGCCGAGCG GGCGTTTCGC TATCTCGACG AGGGCAATCG TCAGAAGCGC
GCGACCTTCG CCTACGACCC GAACGCCATC GACCGCTGGT TCTCGGACAT CATCGCCGCC
TTCCCCTCGG AGATGATCCA ACGGCCCGAG GCCGCGACGC CTGGCAGCGA TCTAGCGGTG
TTCGTGATCG GCATGCCGAG GTCCGGAACC ACGCTGGTCG AGCAGATTCT GGCGTCGCAC
CCGGACGTTC TGGGCGCGGG CGAAATGACC ACCCTGCAAA ACATCGTGAA CACGGCGGGA
GGGTATCCGG CCATCGCGCA ACAGCTGACT CCGGAAAACG AGGCCGCTCT GGGAGGGCTC
TATCTGGACG CCGTGCGGCC GCTCGCGGGC GATCATCACC GACTGGTCGA TAAGATGCCG
TCCAACTTCC TGTTTGCAGG CCTGATCAAT CGGATCCTGC CGCAGGCGCG GATCATCCAT
GTGCGGCGCG ATCCCGCCGA CACCTGCCTG TCAAGCTACA GCAGGCTGTT CTCGCGCGAG
CAGCTGTTCT GCTACGATCA GTCGGAGCTG GCGCGTTTCT ATCAGAACTA CGAGCGCCTG
ATGGATCACT GGCGCGCGGT GCTTCCGGCC GATCGCTTCA TTGAGGTCCG CTATGAGGAC
CTAGTGGACG ATATTGAGCA TGAAGCCCGG CGCCTGACGG ACTTCTGCGG GCTCGACTGG
AGCCCGGCGT GCCTCGATTT CCACCAGACC TCGCGGACGA TCCGCACCGC GAGCCTCAAT
CAGGTTCGCC GTCCCCTCTA TGCCAGCAGC ATCGGACGCT GGCGTGCGTA TGCCCGCCAG
CTTGGACCCT TGCTGACGGG GCTCGGGATC GATCCTGAGG CTGTCGCCGC GCCGACGGTC
GGTCGCAAGA CCGCTGCCGG CAAACGCGCG GGCAAGGGCG CCGGAAAGCG CGATCAAAAA
CCATCGATCG CCAGTTGA
 
Protein sequence
MNTPQQNMTV EQALAQAHGH WNAGQVDQAE RLCQQVLALW PGQSDALHLM GLMAHAYGNL 
DLAIDHLRHA CLAPRAPAQY LSNLAEMCRQ AGRLAEAEQA ARRSVSMDSS LVAGWNNLGI
VLQEAGKLEE SLTCLERVVA LQPDYAEAHN NLGNTLKRLG RLDQARARYE QALKLAPAYA
EALSNLSNLL NDLGLSDEAL ASARRAIDAN PRLSDAYINA AAVEVARDRY DEGLRWVDAL
LVYAPLHAGA LGVRATILRR LGRLDEALVE ARRALATAPD NGEALNTLGE VLQAQDKMDE
ALAAYDRAAQ SLGFAPEKAL VNRAILLMER GDTEAAKAAF DDVLEHFPRS ASAWFNRADL
HRFVPGDPAI GAMEALIGPG GVQNQADRTA LHFALGKAWM DVGDAERAFR YLDEGNRQKR
ATFAYDPNAI DRWFSDIIAA FPSEMIQRPE AATPGSDLAV FVIGMPRSGT TLVEQILASH
PDVLGAGEMT TLQNIVNTAG GYPAIAQQLT PENEAALGGL YLDAVRPLAG DHHRLVDKMP
SNFLFAGLIN RILPQARIIH VRRDPADTCL SSYSRLFSRE QLFCYDQSEL ARFYQNYERL
MDHWRAVLPA DRFIEVRYED LVDDIEHEAR RLTDFCGLDW SPACLDFHQT SRTIRTASLN
QVRRPLYASS IGRWRAYARQ LGPLLTGLGI DPEAVAAPTV GRKTAAGKRA GKGAGKRDQK
PSIAS