Gene Caul_3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3321 
Symbol 
ID5900776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3601507 
End bp3602685 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID641563827 
Producthypothetical protein 
Protein accessionYP_001684946 
Protein GI167647283 
COG category[S] Function unknown 
COG ID[COG3825] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.866495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTCC GCTTCTTCTC CGAGCTCCGC GCCGCCAAGG TCCCGGTGTC CTTGCGCGAG 
TACCTCCTGT TGATGGAGGC CCTGGACAAG GATGTCATCG ATCGGCGGAT CGAGGACTTC
TACTTCCTGT CGCGGGCCAG CCTGGTGAAG GACGAGAAGA ACCTCGACAA GTTCGACCGG
GTGTTCGGCC ATGTCTTCAA GGGCCTGGAA ACGGTCGAGG ACGGCCTGAC CGCCGACATC
CCGGCCGAGT GGCTGAAAGC CCTGACCGAG AAGTTCCTGA CCGACGAGGA AAAGGCCCAG
ATCGAGGCCA TGGGCGGCTT CGAGAAGCTG ATGGAGACCC TACAGGAGCG CCTGAAGGAG
CAGAAGGAGC GCCACGAGGG CGGCTCCAAG TGGATTGGCT CGGGCGGCAC CAGCCCCTTC
GGCAACAACG GCTACAATCC CGAAGGCGTC CGCATCGGCC AGGACAAGAG CCGGCACGGC
AAGGCCGTGA AGGTCTGGGA CAAGCGCGAA TACAAGAACC TCGACGACAG CGTCGAACTG
GGCACCCGCA ACATCAAGGT GGCCCTGCGG CGCCTGCGCA AGTTCGCCCG CCAGGGGGCG
CAAGAAGAGC TGGACCTGAA CGGCACCATC CGCGGCACCG CCGAGAAGGG CTATCTGGAC
ATCCAGATGC GCCCCGAGCG GCGCAACACG ATCAAGGTGC TGCTGTTCTT CGACATCGGC
GGCTCGATGG ACGGCCACAT CAAGCTCTGC GAGGAGCTGT TCAGCGCCGC CAAGACCGAG
TTCAAGCACC TGGAATTCTT CTATTTCCAC AACTGCCTGT ACGAGGCGGT GTGGCAGGAC
AATCGCCGCC GCCAGGTCGA GAAGCTGCCG ACCTGGGAGG TGCTCCACAA GTTCCCCCAC
GACTACAAGG TCATCTTCGT GGGCGATGCG ACCATGAGCC CCTACGAGAT CACCTATCCC
GGCGGCAGCG TCGAGCATTG GAATGAGGAG GCCGGGGCCA TCTGGATGAG CCGGGTCACC
GACATCTACC AGAGCGCCGT CTGGCTGAAC CCGACCCCCG AGCGGCACTG GGACTACACC
CAGTCGATCG GGGTGATGAA GACCCTGATG AACGACCGGA TGTTCCCGCT GACCATCGAC
GGCCTGGACA AGGCCATGCG GGAACTGGTG CGGGGCTAG
 
Protein sequence
MFLRFFSELR AAKVPVSLRE YLLLMEALDK DVIDRRIEDF YFLSRASLVK DEKNLDKFDR 
VFGHVFKGLE TVEDGLTADI PAEWLKALTE KFLTDEEKAQ IEAMGGFEKL METLQERLKE
QKERHEGGSK WIGSGGTSPF GNNGYNPEGV RIGQDKSRHG KAVKVWDKRE YKNLDDSVEL
GTRNIKVALR RLRKFARQGA QEELDLNGTI RGTAEKGYLD IQMRPERRNT IKVLLFFDIG
GSMDGHIKLC EELFSAAKTE FKHLEFFYFH NCLYEAVWQD NRRRQVEKLP TWEVLHKFPH
DYKVIFVGDA TMSPYEITYP GGSVEHWNEE AGAIWMSRVT DIYQSAVWLN PTPERHWDYT
QSIGVMKTLM NDRMFPLTID GLDKAMRELV RG