Gene Caul_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4898 
Symbol 
ID5902360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5291250 
End bp5292869 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID641565418 
ProductATP-dependent DNA ligase 
Protein accessionYP_001686516 
Protein GI167648853 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCT TCGCCCATCT TCTCGATCGA CTGTCCCTGA CGGCTTCGCG CAACGCCAAG 
CTCACCCTGA TCAAGGACTT CCTGCGCGAG ACGCCCGATC CCGAGCGCGG CTGGGTGCTG
GCGGCCCTGA CCGGAGCCCT GTCGTTCAGC GCCGCCAAGC CGGCCTTCAT CCGCAAGGCC
GTCGAGGCGC GAATGGACGC CCAGCTGTTC GCCTGGTCCT ACGACTATGT CGGGGACCTG
GCCGAAACCG TCGCCCTGGT CTGGCCGGCC AAGCCCGGCG CCAATCGCGA GCCAGACCTG
TCGGAGGTGG TCGAGGCCCT GCGCACCGCG TCGCGGACCG AGGTCCAGCG GCTGATCGAG
GGCTGGCTGG ACGCCCTGGA CGCCGACGGC CGCTGGGCCT TGCTGAAGCT GATGACGGGC
GGCCTGCGGG TCGGCGTGTC GGCGCGATTG GCCAAGCAGG CCTGCGCTGA CTTCGGCGGG
GTGGAGATCG GCGCGGTCGA GGAGGTCTGG CACGCCATGA CCCCGCCCTA TGGCGACCTG
TTCGCCTGGC TGGAGGGACG CTCCGAGCAG CCCTCGCCCG ATGCGCCGGG GCGGTTCCGG
CCGGTAATGC TGGCCCAGGC GATCGACGAG GTCGTGGACT TCGCCAAGCT CGATCCCGCC
GACTACGCCG CCGAGTGGAA GTGGGACGGC ATCCGCGTCC AGGCGGTCAG CGAACGCGGC
GAGCGGCGGC TCTATACCCG CACCGGCGAC GACATTTCGG CGACCTTTCC CGACGTGCTG
GAGGCCCTGA CCTTCGAAGG CGCGCTCGAC GGCGAGCTCT TGGTGATGCG CGATGGGCGG
GTCGCGAGCT TTGGCGACCT GCAGCAGCGG CTGAACCGCA AGACGGTCGA CGCCAAGCAA
CTGGCCAATT TTCCGATCGG TATTCGCGCC TATGATCTGC TGCTCGACGG CGAGGCCGAC
CTGCGCGGCC TGCCGTTCGT CGAGCGGCGC CAGCGCCTGG AGGCCTTCAT CGCCCAAATT
CCGACTGGAG GGGCCGCCAG CCCGCGCATC GACCTGTCGC CGATCCAGCC GTTCGCGACC
TGGGAGGCGC TGGCGGCCTT GCGCGCCGAG CCGCCGGCCG GTGATCCCAC GATCGCCGAA
GGTCTGATGC TCAAGCGCTG GGACAGCGTC TACGAACCGG GCCGGCCCAA GGGGCCATGG
TTCAAGTGGA AGCGCGATCC ACGCCTGATC GACGCCGTGC TGATGTACGC TCAGCGCGGC
CACGGCAAGC GATCGAGCTT CTACTCCGAC TACACCTTCG GGGTCTGGCG CGAGGACGAG
ACCGGGACAC GCCACCTGAC CCCGGTCGGC AAGGCCTATT TCGGCTTCAC CGACGAGGAG
CTGAAGCAGA TCGACAAATT CGTCCGCGAC CACACCGTCG AACGGTTTGG GCCGGTGCGC
TCCGTGCGGG CCGACTGGGA TTTCGGCCTG GTGTTCGAGG TCGCCTTCGA GGGGCTGCAG
CGCTCGACCC GCCACAAGTC CGGCGTGGCC ATGCGCTTTC CGCGCATCAA CCGCATCCGC
TGGGACAAGC CGTCGCGCGA GGCGGACGAG CTGAACACGC TGGAGCGGAT GCTGGATTGA
 
Protein sequence
MRAFAHLLDR LSLTASRNAK LTLIKDFLRE TPDPERGWVL AALTGALSFS AAKPAFIRKA 
VEARMDAQLF AWSYDYVGDL AETVALVWPA KPGANREPDL SEVVEALRTA SRTEVQRLIE
GWLDALDADG RWALLKLMTG GLRVGVSARL AKQACADFGG VEIGAVEEVW HAMTPPYGDL
FAWLEGRSEQ PSPDAPGRFR PVMLAQAIDE VVDFAKLDPA DYAAEWKWDG IRVQAVSERG
ERRLYTRTGD DISATFPDVL EALTFEGALD GELLVMRDGR VASFGDLQQR LNRKTVDAKQ
LANFPIGIRA YDLLLDGEAD LRGLPFVERR QRLEAFIAQI PTGGAASPRI DLSPIQPFAT
WEALAALRAE PPAGDPTIAE GLMLKRWDSV YEPGRPKGPW FKWKRDPRLI DAVLMYAQRG
HGKRSSFYSD YTFGVWREDE TGTRHLTPVG KAYFGFTDEE LKQIDKFVRD HTVERFGPVR
SVRADWDFGL VFEVAFEGLQ RSTRHKSGVA MRFPRINRIR WDKPSREADE LNTLERMLD