Gene Caul_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1812 
Symbol 
ID5899267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1921243 
End bp1923564 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content73% 
IMG OID641562302 
Producthypothetical protein 
Protein accessionYP_001683439 
Protein GI167645776 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.439098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC GGGTCGAGAG CGCCACCCTG GGCGGCGACG GGGTCCTGAC GGTGGTCATC 
GTGACCCCTT CGGCGACCGA ACAAGCCACC CCGGTGGTCA CCGTCCGCGA CGTATCCGGC
CGGATCCTGC CCCTGGCGCC GATCGCGTCC GCGCCCGCCA ACCCGGCGGA TGATCTGGTC
GCGCCCTCCA ACCTCAAGAC CGTCAGGGTC ATGGCCCAGC CGCCCCATCC CCTGTCGGAT
CCGCCCTCCT ACCTCCTGAC AGTCGATCCG CCCGGTGACA GCGCCTGGAT CGCGCCGGCC
ACGCCGCCGG CCGCCGCGCC CACGGCGGGG ACCACCTCCT CGGCGGCCGC CTCGACCGGC
TCGACCGGCT CGACCAATAT CGAGTACACC GCGCGCGACT TCACCGGCCT GCGCACCATG
ATGCAATCGC GGCTGGCCCA GAACCTTCAA GGCGACTCCG CCTGGGCGCT GGACCATCCG
GCCGATCCGC TGATGACCCT GGCCGAGATC CTGGCGGCGC GCGGCGACTA CCTGTCCTAC
CAGCAGGACG CGGTGGGCAC GGAAGCCTAC CTGTCGACCG CCCGCCGCCG GCTGTCGGTC
CGCCGCCTGG CCCGGCTGCT CGACTATGGG ATCAACGACG GCTGCAACGC CCGCACCTGG
CTGGCGTTCG GCGTCGCCCA GGACGTCATC TTGCCGGCGG GGATGGCCGT GGTCACGCCG
CAGCCCGGAC GGCTGGGCGT CGTGCTGCCG CCCGGCCCGC TGGGTCCGGG AACCACGGTG
TTCGAGACCA TGCAGCCGCT GACCGCCCTG ACCGCCCTCA ACGACCTGGG CCGGTGGCTG
GTCACGTCCG CCAATCATGT GATCCCGGCC CAGACCTGCA GCCTGGTCCT GCCCGGCCAG
TTCCCAGGCC TCGCCGCCGG GCGGGTGCTG GTGTTCGAGC AGATCATCTC GCCGGACAGC
TCCGCCCCGT TCGGCGCCCA GGCGGTGCGG CTGACCTCGG TGACCTACGC CCCCGCAAGC
GGCGGAGGCG GCGGCGGCTC GACCACGATC ATCTGGCACC CGCGGGACGC CCTGGCCAGG
GACCTGACCG TGCCGGCCTC GGGCGCCGGC CTGCCCTCGC TCTATGGCAA TGTCGTCCTG
GCCGACCACG GCCAGACCAC CACCGCCGCC CTGGTCCTGA ACGACGAGGG CCTGCCCGCC
TCGGTCACGG GCGCCGATCC CGTCTTCGCC GCCCCGCCGC CCGCCACCAA CGCCGGATGG
GACGGCGCCG ACCCGGACGC GAGCCTGGCG GACGTTCCCT CCGCCGCCGC CTGTCTGATC
CAGGATCCGA GCCTGGCGGT CTGCCAAGTC ACGCTCAACG ACGGCACGCG CGACTGGCGG
CCGGTCCGCG ACCTGCTGCG CGCGCCATCC GACGCCCGTC TGTTCGCTGT CGAGCCCGAG
GACCCCGCGC CCACCACCGG CCAGCGGCGG CTGCTGATCC GCTTCGGCGA CGGCGTCCTG
GGCCGCCCGG CCCCCGCTGG AGCCACGTTC TCGGCCAAGG TGCGCGAGGG CCACGGCAAG
ACCGGCCGCG TCAAGCCGCG CACCCTGGTT CAGATCCCGC CCTCATCGGT TTCCGGCGTC
ATCCGCACCG TGCTCAATCC GCTGGCGGCC GCGCCGACCC CGCCGGAGTC GGCGGCCGCC
GCCCGGCTGT TCGCGGCCAC CGCCTTCCGC GTCCAGCGAC GCGGGGTCAC CCCCGGCGAC
TGGGAGATGC TGGCGCGCGA GCACCCCCTG GTCACCGAGG TCGGGGCCAC CGCGCCGACC
GGCGACGAGC AGGGTTGCCA TGTCGCGATC GAGACCCTGG CGCCGGCCCA GGCCACCTTC
GACATCGTCA GGGACGACCT GTTGGCCTAT GCGCTGATCG GCGCGCGCCC GACCATCACG
CCGCTGACCG TCGTGCCGCT GAACATTGTC ATGGCCGTCT ATTGCGACCC CGACGCCGAC
ATCGCCAGTC TGCGCAGGGA CTTGGCCCAG GCGCTGGGCG TCGGCCTGCT GCCCGACGGC
GGCCCCGCCT TCTTCAATCC CGCGCGCCTC ACGCCGGGGC GCTCGATTCC GCTGGACGAC
GTGGTCTCCG TGATCCTGGC CCAGGACGGC GTCAGCTGGG TCAATCTCAA TCCCGACACC
GACCCCCGCA TCCGCTTTGG CCGCCTGGAC GACCCAGGCA GCGGCAAGCG CGGCTTCACG
ACCGGGTCCA TTCCGGTCAA GCCCACCGAA CGGGCCCAGG TTTCCGCCGA CAACCGGCGC
CCCGAGGATG GCGGGGTCAG CCTCTATGTG ATCGTCGCGT GA
 
Protein sequence
MSLRVESATL GGDGVLTVVI VTPSATEQAT PVVTVRDVSG RILPLAPIAS APANPADDLV 
APSNLKTVRV MAQPPHPLSD PPSYLLTVDP PGDSAWIAPA TPPAAAPTAG TTSSAAASTG
STGSTNIEYT ARDFTGLRTM MQSRLAQNLQ GDSAWALDHP ADPLMTLAEI LAARGDYLSY
QQDAVGTEAY LSTARRRLSV RRLARLLDYG INDGCNARTW LAFGVAQDVI LPAGMAVVTP
QPGRLGVVLP PGPLGPGTTV FETMQPLTAL TALNDLGRWL VTSANHVIPA QTCSLVLPGQ
FPGLAAGRVL VFEQIISPDS SAPFGAQAVR LTSVTYAPAS GGGGGGSTTI IWHPRDALAR
DLTVPASGAG LPSLYGNVVL ADHGQTTTAA LVLNDEGLPA SVTGADPVFA APPPATNAGW
DGADPDASLA DVPSAAACLI QDPSLAVCQV TLNDGTRDWR PVRDLLRAPS DARLFAVEPE
DPAPTTGQRR LLIRFGDGVL GRPAPAGATF SAKVREGHGK TGRVKPRTLV QIPPSSVSGV
IRTVLNPLAA APTPPESAAA ARLFAATAFR VQRRGVTPGD WEMLAREHPL VTEVGATAPT
GDEQGCHVAI ETLAPAQATF DIVRDDLLAY ALIGARPTIT PLTVVPLNIV MAVYCDPDAD
IASLRRDLAQ ALGVGLLPDG GPAFFNPARL TPGRSIPLDD VVSVILAQDG VSWVNLNPDT
DPRIRFGRLD DPGSGKRGFT TGSIPVKPTE RAQVSADNRR PEDGGVSLYV IVA