Gene Caul_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4157 
Symbol 
ID5901619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4507044 
End bp4508342 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID641564678 
Productintegrase family protein 
Protein accessionYP_001685779 
Protein GI167648116 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.225884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.636127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCTGT CCATTCGAAC CGTCGAAGGC CTGGAGCCGA GCGACACTGA CTACTTCGTC 
TGGGATGACG AACTACCGGG CTTTGGGCTG CGAGTGTACG CCTCGGGCCG CAAGACCTAC
CTCGTCCAGT ACCGGTCGAG CCGGCGCACC CGGCGCCTGA CGCTCGGCAT CCACGGCGCC
CTTACGCCCG AGATGGCTAG GCGCGAGGCC AAGATTCATC TCGGTGGCAT CGCCAAGGGC
GGTGACCCGG CCGAGGAGAA GAAGACCAAC CGCAACGGCA TGACCGTTCG CGAGCTCTGC
GAACGATACA TGGAGGACGC TGAGATCGGC CTGGTCCCCG GCAAGCGGCG GATGCCGAAG
AAGCCGTCCA CCCTCCAGGC TGATCGCAGC CGCATCAGCT CCCACATTAC GCCGTTGCTC
GGCACGCGCC TGGTCAAGGA GGTCACGCGC GCCGACATCC ATCAGTTCAT GCGCGACGTC
GCCCTCGGCA AGGCGCGCCG CGATCGCAAG ACCAAGCTGA GGGGCCGTTC GATCGTGCGC
GGCGGTCTGG GCACGGCGGG GCGCACCCTG GGTCTGCTCG GCGGCATCTT TACCTACGCT
CAGCGGATCG GCGTGATCGA GAACAACCCC GTTCGAGGCA TCCCCAAGCC GGCGGTGAAC
AGTCGCTATC GACGTCTCAG CGCAGCTGAA TATCGGACGC TGGGCGAAAC GCTGCGCGAG
GCCGAGCAAG AAGGGCACAA CGAGAAGGCC CTGACGATCA TCAAGTTGCT GGCGCTGACA
GGGTGTCGGC GGGGCGAGAT TGAGAAGCTG CGTTGGGAGG AGGTCGACGA CGCCGGCCGC
TGTTTCCGGC TGGCGGACAC CAAGGAGGGG CGCTCGATCC GTCCGATCGG CCCGGAGGTC
TTTGCCTTGC TCAATCATCG GCGGCCTGCC GACGCCAGGG GTTACGTCTT TGAAGGCGAC
GTCGCCGGCA AGGCCTTCGA CGGCGTGCCC AAGGTGTGGG GCAAGACGAT CCGCAAGCAG
CTGATGGATG TGACGCCACA CGTTTTGCGG CACAGCTTCG CCAGCATGGC GAACGATCTC
GGCTTTACCG AAGCGACGAT CGCCGCGCTC CTTGGCCATG CGGCCGGCAC CACGACCAGC
CGCTACGTCC ACAACCTCGA CACCGCCTTG ATCTCGGCCG CGGAGAAGGT TTCAGGGCAT
ATCGCCGGCC TGCTACGCCA TGCCGACGAG CGCCAAGGTC TCAAGTCGCT TTTCGAAAGA
TTCGGTGAAG CCACGGGCTC GTGGGGCAAG GCCGCGTAG
 
Protein sequence
MRLSIRTVEG LEPSDTDYFV WDDELPGFGL RVYASGRKTY LVQYRSSRRT RRLTLGIHGA 
LTPEMARREA KIHLGGIAKG GDPAEEKKTN RNGMTVRELC ERYMEDAEIG LVPGKRRMPK
KPSTLQADRS RISSHITPLL GTRLVKEVTR ADIHQFMRDV ALGKARRDRK TKLRGRSIVR
GGLGTAGRTL GLLGGIFTYA QRIGVIENNP VRGIPKPAVN SRYRRLSAAE YRTLGETLRE
AEQEGHNEKA LTIIKLLALT GCRRGEIEKL RWEEVDDAGR CFRLADTKEG RSIRPIGPEV
FALLNHRRPA DARGYVFEGD VAGKAFDGVP KVWGKTIRKQ LMDVTPHVLR HSFASMANDL
GFTEATIAAL LGHAAGTTTS RYVHNLDTAL ISAAEKVSGH IAGLLRHADE RQGLKSLFER
FGEATGSWGK AA