Gene Caul_5446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5446 
Symbol 
ID5897118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp160651 
End bp161682 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID641550733 
Productintegrase catalytic region 
Protein accessionYP_001672219 
Protein GI167621711 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGC GCTACGATCA TATCACGCTT GAAGAACGCC ACCTGATCTG TCGTTGGCGT 
GACGCCAAGG TGCCTGTGCG GGTCATCTCC GAGCGCCTCG GCCGCCATCC ATCGACTGTC
CATCGCGAGA TCCGACGCAA TTGGTTTGAC GATGGGCCCT GGCTGCGCGG CTACTTCGCC
ATCGCCGCCG ACCAGCGCGC CTCGTCGCGA CGCAGGCGTG TTGGCAAGCT GCATCGCGAT
CCTGAACTGG CCCACTTCGT CACACAGCGC TTGCGAGAGA CCTGGTCGCC AGAACAGATC
GCCGGTCATC TCAAAGCGAC CCGGCAGCTC CAAGCCTATG CCTGCCACGA GACCATCTAC
CGCTACGTTT ACGGCCCAGA TGGCCGGGCC GCGGAGCTCT ACAAGCTGCT CCCCAGGATG
CGTCGGCGCC GGCGCGCGCG CTATGCTCGA AGGCCGCGCG GCGGCCTGCA TATTCCGCTA
CAGAACACCA TTGCACAGCG CCCCGCCCAC ATCGGCGAAC GCCAGGGTTA CGGCCATTGG
GAGTGCGATC TGATCGCTTT CCGGCAGGAA TACGGGCGTC ACAACATCAC GACGTTGGTC
GAGCGTCGAA GCCGGTATCT GATCATGATC AAAAACCCGA GCCGCAGCTC GACGGGCATC
ATGGCCGGCT TGGCCGAGCG TCTGGAGCCG CTTCCTCCGC CGATGCGGCA ATCCATTACC
TTTGATCGCG GCACTGAGTT TGCTTTCTTT GCGACCTTGA AGCGCTCGCT GGAGATCGAG
AGCTACTTCT GCAAGCCACA GGCGCCTTGG CAAAAGGGTA CTGTCGAGAA CACCAATGGC
CGCCTGCGAC GCTTCCTACC CAGCGACACG GACATCGCGT TGATCCCGCC AGAGAAGCTG
CTGGAGCTGA CCACGCGGCT CAATCGGATC CCGCGCAAGT GTCTTGGATA TCGCACGCCG
GAGGATGTGC TGGGTGAGCA GATAGCGGCT ACGGCTGGGA CACGGGCGAC GAACGGGCTC
TGCGCTACGT GA
 
Protein sequence
MGSRYDHITL EERHLICRWR DAKVPVRVIS ERLGRHPSTV HREIRRNWFD DGPWLRGYFA 
IAADQRASSR RRRVGKLHRD PELAHFVTQR LRETWSPEQI AGHLKATRQL QAYACHETIY
RYVYGPDGRA AELYKLLPRM RRRRRARYAR RPRGGLHIPL QNTIAQRPAH IGERQGYGHW
ECDLIAFRQE YGRHNITTLV ERRSRYLIMI KNPSRSSTGI MAGLAERLEP LPPPMRQSIT
FDRGTEFAFF ATLKRSLEIE SYFCKPQAPW QKGTVENTNG RLRRFLPSDT DIALIPPEKL
LELTTRLNRI PRKCLGYRTP EDVLGEQIAA TAGTRATNGL CAT