Gene Caul_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2112 
Symbol 
ID5899567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2271708 
End bp2273582 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content70% 
IMG OID641562601 
Productsulfotransferase 
Protein accessionYP_001683738 
Protein GI167646075 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.214082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCA TGGAAGCCGA TCGTCCCATC CTGGCCGCCG AGCCGGGCGC GTCATCGGAC 
GCGGCTTGCG CCTGCGGCTC GGGCCTGGGC TTGAGCCTCT GCTGCCAGCT CGACCTGACC
AAGGCGAACC GCGAACCCGT CTCGCCCGAC TTCGCCTCGA CCCTGTCGCG GATGGCCGAC
GCCTATCGCG ACCGGGAACT GGTCCTGGCC GAGCACATGG CCCACGCCGC GCTGCGCGAG
GCGCCGGCCC ATCGCGACGC CCTGGGCGGG CTCTACAACG TCTGCAAGGA CACCGGCCGG
ATCGCCGCCG CCGAGGCGCT GGTGCGCCGA ATGGCGAAGC TGTACCCGGA CGACCCCATG
GTGCGAATGA ACGCCGCCCT GTTCTTTCGC GGCAAGGGCG CGAAGGGCGA GGCAGAGGCG
CACGGCCGGG CCCTCGTTCG CCTTGCGCCC ACGGCGTCGG CGGCGCACGT GACGATGGGA
CTGGTGTTCA ACGTCACGAA CCAGTTCGTC AGCGCCGAGC ATCACCTTCG GCGGGCCCTG
GACCTGGGCG GTCCGGCCGA CGCCGAGACC CTGGGCGCCC TGGCGATCGC CCTTCGCGGA
CAAGGTCGGT TGGACCTGGC CCGCGACGCC TTCGCCCAGG CCGTGGCGGT CGCGACGCGC
GAGGCCCCGC ACCTGCTGAT CGCCTGGGCC GATCTGGAGG AGGCCGCCAG CCGGTTCGAC
GCCTCCGAGA CCCTGCTGGA CCGGGCTCAG GCCCTGGCGC CGCGCGATCC TCGCATCGCG
CTGGGCCGCG CCACCCTGTC GCGCCGCCGT CAGGCCTTCC AGGAGGCGCT CGACACGCTG
GACGCGCTTG AGGCGCGCGG CCCCAAGGAC CAGGTCGCCG CCCTGGCGAT GAAGGAGCGC
GGGCTGGTGT TCGACGCCCT CGGCCGTCAC GACGAGGCCT TCCTCGCCTT CACGAGCTTC
AAGCAGCGAC ACGGCGCGTG GACAGGCCAT GTCTACGCCG CCGATCACGC GGCCCTCCAG
GCGGCTTGGT TGAAGGACTT CTTCACCGCC GAGCGTACAC CCCACCTGCT CCGCGCCGGC
CCGCGGACCG ACCTGCCGCA GCCGATCTTC GTGGTCGGCT TTCCCCGGTC AGGCACCACC
CTGGTCGAGC AGACCCTGAC CTCGCACCCA GCCATATCGG CCGGCGACGA GTTGCCGATC
ATCAACCGCC TGATCGAACG CCTACCCAGC CTGCTGGGCA GCCTGGGCGC CTATCCCGGC
GCTCTGACCG AACTGTGGGT CGGCGACCGG GCCGGCATGA TCGACAGCCT GCGCGACATC
TACCTCAACG AGGCGATCCG CTTGGGCGCG ACCCGCCCCG GGGCCGCCTG GTTCACCGAC
AAGATGCCGC TGAACGAGAC CCATCTGGGG CTGATCCACC TGCTTTTCCC CCAGTCGCCG
ATCGTCCAAC TGGTGCGCCA CCCCATGGAC GTCGTGCTGT CGGTGTTTTC CAACGGGCTG
AGCCACGGCT TCCACTGCGC CACGAGCCTG GAGAGCGCCG CCCGCCATTT CGCCCTGACC
GCCGATCTGG TGGAGCACTA CAAGACGGTG CTGCCGATGC GTCACCTGGA CGTGCGCTAT
GAGGATATGG TGCGGGACCA GGAGGCCCAG GTGCGGCGAC TGTTCGACTT CATCGGCGAG
CCCTACGATC CGCGCGTTCT GGATTTCCAC GAAAACCGCC GCCCTGCCCG TACCGCCAGC
TACGCCCAGG TGACCGAGAA GCTCTATGAT CGCTCGATGT TCCGCTTCCG CGACTATCGC
AGTCATCTGA CGCCGGTGGA ACCGATCCTG AGACCCTGGA TCGAGAAGCT CGGCTACGGC
GCGGAGGCGG ACTAA
 
Protein sequence
MSVMEADRPI LAAEPGASSD AACACGSGLG LSLCCQLDLT KANREPVSPD FASTLSRMAD 
AYRDRELVLA EHMAHAALRE APAHRDALGG LYNVCKDTGR IAAAEALVRR MAKLYPDDPM
VRMNAALFFR GKGAKGEAEA HGRALVRLAP TASAAHVTMG LVFNVTNQFV SAEHHLRRAL
DLGGPADAET LGALAIALRG QGRLDLARDA FAQAVAVATR EAPHLLIAWA DLEEAASRFD
ASETLLDRAQ ALAPRDPRIA LGRATLSRRR QAFQEALDTL DALEARGPKD QVAALAMKER
GLVFDALGRH DEAFLAFTSF KQRHGAWTGH VYAADHAALQ AAWLKDFFTA ERTPHLLRAG
PRTDLPQPIF VVGFPRSGTT LVEQTLTSHP AISAGDELPI INRLIERLPS LLGSLGAYPG
ALTELWVGDR AGMIDSLRDI YLNEAIRLGA TRPGAAWFTD KMPLNETHLG LIHLLFPQSP
IVQLVRHPMD VVLSVFSNGL SHGFHCATSL ESAARHFALT ADLVEHYKTV LPMRHLDVRY
EDMVRDQEAQ VRRLFDFIGE PYDPRVLDFH ENRRPARTAS YAQVTEKLYD RSMFRFRDYR
SHLTPVEPIL RPWIEKLGYG AEAD