Gene Caul_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2354 
Symbol 
ID5899809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2552087 
End bp2553232 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID641562845 
Producthypothetical protein 
Protein accessionYP_001683979 
Protein GI167646316 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGA CCCGATCCGT TCACCCGTGC GCGCTGGACA GCCAGGACGT CGAGACATGG 
CGCGCCATGG CCTCGGCGCA TCCCGAGTTC CGCAACCCTC TGCTCGGCCC GGATTTCGCC
CAGGCGGTCG GCGCCCTTCG GCCCGATGCG CGCGTGGCGA TCTTTCGGCG CCACGGCCGC
GTGCTGGGCT ACCTGCCCTA TCATCGCCGG GCCTGCGGCC AAGCCTGGCC GATCGGCGCG
CCGTTGTCGG ACTATCACGC CCTGGTCGGC GTCGCCGACG CCGGGCTCGA CGGCCGTGAC
GCCCTCGCCG CCGCCGGATT GTCGGCGTTC CGCTTCGGCG GCCTGGTCGA TCCGTTCGAG
GTGTTCGGCC CAGGCGCCGA TCAGGTCGGC CATGTGATCG CGCCGGCCGA CGGCCCCGAG
GCCTATCTGG AGCAGGTCCG GGCGGCCAAT CCCAAGAAGA TCAAGAACTA CCGCCGGCTG
GGCGCCAAGC TGGAACGCGA GTGCGGCGCC GTGCGCCTAG TCGCCGACGA CCGGTCACGA
CCGGCTTTCG ACCAGTTGAT CGCCTGGAAG CGCGAGCAGC TCATGCGCAC CGGAACCCAC
GACTTCCTGG GCGCCGATTG GTCGCTCGAC CTAGTGACCC GGCTATTCGA AGGCCAGCAG
GGCGAGCTTC GCGGCCTGAT GATCTGCCTC TATGCCGGCG ACACCCTGGT GGCAGGGCAT
TTCGGCGTGC GGCAGGGCGA GGTCTTTCAC CCTTGGATCG CCTCCACCCA CCCAGACTAT
GGGCCTTGGT CTCCCGGGCA TCAGCTCTTC CCGCGTGCGA TCGCGGCCAT GCCCGCCCTG
GGCCTGACAA CCTACGACCT CGGCTGCGGT CACGATCACT ACAAGAGCGT CTACGCCTTG
CGGACCCGCA TCGTGACCGC GGGCCTGGCG ACCGCCGGCA ACCTGGCCGG CGACATCGCC
CGCTCCATCG ACGCCGCCTG GCTGCTGGCC GGCGCCGAAA GCCCCGGGCC GGTCGGCCGC
CTGCGGCGAC GGATGGACGC CATCGCCAAG GTCGACCTGA CCCTGTCCGG CCGCCTGCGG
GGCCTGGCCT TCGCCGTGGC CAGCCAGGAC CGTCGGCGCC GCGCCACCGA ACACGAGCAG
CCTTGA
 
Protein sequence
MLETRSVHPC ALDSQDVETW RAMASAHPEF RNPLLGPDFA QAVGALRPDA RVAIFRRHGR 
VLGYLPYHRR ACGQAWPIGA PLSDYHALVG VADAGLDGRD ALAAAGLSAF RFGGLVDPFE
VFGPGADQVG HVIAPADGPE AYLEQVRAAN PKKIKNYRRL GAKLERECGA VRLVADDRSR
PAFDQLIAWK REQLMRTGTH DFLGADWSLD LVTRLFEGQQ GELRGLMICL YAGDTLVAGH
FGVRQGEVFH PWIASTHPDY GPWSPGHQLF PRAIAAMPAL GLTTYDLGCG HDHYKSVYAL
RTRIVTAGLA TAGNLAGDIA RSIDAAWLLA GAESPGPVGR LRRRMDAIAK VDLTLSGRLR
GLAFAVASQD RRRRATEHEQ P