Gene Caul_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3996 
Symbol 
ID5901458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4325933 
End bp4326889 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content71% 
IMG OID641564517 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001685619 
Protein GI167647956 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.729024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGA TGGATAGGAG CAAGTCTAGC ACGACGATCG CCGGGATCGA TGTGGGCAAA 
CGCCAGCTGG ATGCGGCGGT GCACGGTCGC GCCGAGCAGC ACCAGGTGGC CAACGACCCG
GCCGGCTGGG AGGCGCTGGT CGCCTGGCTG GGCGATCGGG GCGTGGCGCG GGTGGGGCTG
GAGGCGCCGG CCGGCTACGA GCGCGGTGTG CGGGCCAGGC TGGAGGCGGC GGGCCTGGAG
GTGGTGGTGC ATCAGCCGCT GGAGGTGAAG CTGTTCGCGC GGCTCAAGCG GCGGCGGGCC
AAGACCGACC GGCTGGACGC GGCCCTGATC GCGGCGGCGA CGGCGCAGGT CGATACGGTG
CGGGCGGCCC AGGATCCGCG CCTGGCCGAG TTGGCCGAAC GCCTGACGGC CTACGAGCAG
ATCACCGATC AGGCCGCCCA GTTGAAAACC TTCATGGAGC ACGTGGCTCT GCCCGACCTT
GTGGCCAGCC TGGGCGAACA GATCCAAAGC CTGTCGCGAC TGAAAGCCCG CCTCGCGCGC
GAGATCCTGG CGCGGCTCAA GGCTTGGCCC GACCTGCTGG CCCGTTGGCG CCTGCTGCAG
TCGTTGCCGG GCGTGGGACC ACTGGTGGCC GCCAGCCTCG TCGTGCGCAT GCCCGAGCTG
GGCGTCCTGC GGCGCGGCCA GCCCGCAGCC TTGCTCGGCG TGGCTCCCTT CGACCGCCAG
TCCGGCCAAT GGCGAGGCCA GAGCTTCATT GGGGGCGGCC GAAGCCGACC GCGCCGCATG
CTCTATCTCG CCGCCATCGC CGCCAAGCGC TTCGATCCCG GCTTCAAGGC CTTCGCCGAG
CGCCTCCTGA GCGCCGGCAA GCCCCCCAAG GTCGCCATCG TCGCCGTCAT GCGAAAACTC
ATCGAGGCCG CCAACCTCAT CCTCGCACGC CAGCAACCCT GGGTCCGACA CCCCTGA
 
Protein sequence
MGSMDRSKSS TTIAGIDVGK RQLDAAVHGR AEQHQVANDP AGWEALVAWL GDRGVARVGL 
EAPAGYERGV RARLEAAGLE VVVHQPLEVK LFARLKRRRA KTDRLDAALI AAATAQVDTV
RAAQDPRLAE LAERLTAYEQ ITDQAAQLKT FMEHVALPDL VASLGEQIQS LSRLKARLAR
EILARLKAWP DLLARWRLLQ SLPGVGPLVA ASLVVRMPEL GVLRRGQPAA LLGVAPFDRQ
SGQWRGQSFI GGGRSRPRRM LYLAAIAAKR FDPGFKAFAE RLLSAGKPPK VAIVAVMRKL
IEAANLILAR QQPWVRHP