Gene Caul_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2255 
Symbol 
ID5899710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2448991 
End bp2450499 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID641562746 
Productintegrase catalytic region 
Protein accessionYP_001683880 
Protein GI167646217 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.659658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGAGA CTGTGGTGCG GATTCGCCGC GAGTATGCGG CGGGCAAAGC GATCAAGGCG 
ATCTGCCGGG ATTTGAGACT GTCGCGGAAG GTCGTGCGCA AGGCGATCCG GGCAGAGGAA
GGCGCGTTCA GCTATCAGCG GACGACGCAG CCGTTTCCGA AGATCGGGCC GGTGCGTGAT
CGGCTTGTGC AACTTCTGAC GGAGAACGAG GCGCGGCCCC GGCGGGACCG TTTGCGACTG
ACGCGGGTCT GGGACCTGTT GGTCCAGGAA GGCTACGACG GCTCCTATGA CTCGGTCCGT
CGTTACGCGG CCCGCTGGCG CGAGGAGACC AAGACGGCCC CAGGCGATGG CGGGACAGCG
TTCGTGCCGC TGATGTTCGC GCCGGGCGAG GCGTTCCAGT TCGACTTCAG TCATGAGGAC
GTGGAGGTCG CCGGCCAGCC GATGCGGGTG AAGGCGGCCC ATGTTCGGCT TTGCGCGTCG
CGTGCGGTCT ACGTCAGGGT TTATCCGCGT GAGACCCAGG AGATGGTGTT CGACGCCCAT
GCCCGGGCGT TCGCCTTCTT CGGCGGGGTC CCGACGCGGG GCATCTACGA CAACATGAAG
ACCGCCGTTG ACGCGGTGTT CTTGGGCAAA CAGCGGGTCT TCAACCGTCG CTTCCTGCTG
ATGGCCGATC ATTACATGTT CGAACCGACT GCTTGCACGC CCGCCGCGGG CTGGGAGAAG
GGCCAGGTCG AGAACCAGGT GCAGACCAGC CGGGAGCGGT TCTTCAAACC GCGCTTGCGG
TTCGCCAGCC TCGAAGAGCT GAACGGCTGG CTGGAGGCCG AGTGCCGCCG CTGGGCGCGC
CTACACCCCC ATCCCGAGCA GCGCGAGATC ACCTTGGCCC AGGCCTTGGA GGCCGAGCGG
CCGGCGCTGC AGGCGATCCT GGCGCCGTTC GACGGCTTCC ACGAGGTCGA GCACGCGGTG
ACCGGCACCT GCCTGATCAC CTTCGACCGC AACCGCTACT CGGTGATGGC CAAGGCCGCC
AAGCGCACGG TGCAGGTGCG CGCTTATGCC GACAAGATCG TCGTACGCTG CGCCGGCGAA
GTCGTCGCCG AGCATGCCCG GTCCTTCGGC CGGGGCCGGA CGATCTATGA TCCCTGGCAC
TACCTGCCGG TCCTGGCCCG CAAGCCGGGC GCTCTACGCA ACGGCGCGCC GTTCCAGGAT
TGGTCCCTGC CGCCGGCCCT GACGCGGCTC AGCAAGAAGC TGGGGCGTGG CGACGAGGCC
GACCGCCGGT TCGTTCGCGT GCTGGCGGCG GTGCTGATCG ATGGCCTGGA TGTGGTCGAT
GACGCCGTCC GCGAGGCCCT GGACGCCGGC GCGGCTAGCG ATGAGGTCAT CCTCAACATC
CTGGCCCGGC GACGCGAGCC GCCAGCGCCA CAGCCGATCA CCACCTGCGA GGCGCTGGTC
CTGCGCCATC CGCCCATCGC AGACTGCGCC CGCTACGACC TGCTGCGAGG CCCCCGTGCA
GCGGCATGA
 
Protein sequence
MVETVVRIRR EYAAGKAIKA ICRDLRLSRK VVRKAIRAEE GAFSYQRTTQ PFPKIGPVRD 
RLVQLLTENE ARPRRDRLRL TRVWDLLVQE GYDGSYDSVR RYAARWREET KTAPGDGGTA
FVPLMFAPGE AFQFDFSHED VEVAGQPMRV KAAHVRLCAS RAVYVRVYPR ETQEMVFDAH
ARAFAFFGGV PTRGIYDNMK TAVDAVFLGK QRVFNRRFLL MADHYMFEPT ACTPAAGWEK
GQVENQVQTS RERFFKPRLR FASLEELNGW LEAECRRWAR LHPHPEQREI TLAQALEAER
PALQAILAPF DGFHEVEHAV TGTCLITFDR NRYSVMAKAA KRTVQVRAYA DKIVVRCAGE
VVAEHARSFG RGRTIYDPWH YLPVLARKPG ALRNGAPFQD WSLPPALTRL SKKLGRGDEA
DRRFVRVLAA VLIDGLDVVD DAVREALDAG AASDEVILNI LARRREPPAP QPITTCEALV
LRHPPIADCA RYDLLRGPRA AA