Gene Caul_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3801 
Symbol 
ID5901263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4115786 
End bp4117834 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content63% 
IMG OID641564323 
ProductSMC domain-containing protein 
Protein accessionYP_001685425 
Protein GI167647762 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0420971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000733932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCTCG ATTTCATCGA AATATGTGGC TTTCGCGGGT TCAGGGATCT CGTCCGAATT 
AACTTCGGTC GTGGCTTCAC CGTCATCACC GGACGCAACG GCGTGGGCAA GAGCACCCTA
TGCGATGCTG TCGAATTCGC GATCATCGGC TCCATCGACA AATACGCCGT CGAAAAAGCC
GCGAAGGAGA GCCTCAGCGA TTATCTCTGG TGGCGGGGCG AAGGTGTGCC CAAAGCCCAT
TACGTTATAG CCTCGTTTAT CGACGATGAC GGAAAGCCAT TCACCATCAC GCGCACTCGG
GAGTCCGGCT CCGATCGTTC GCCAGAGGAG ATCCAAGCTG CGCTTTGCCG GGGGCCTGCG
CCGGACGACG CGCTTCGTCA GCTCACCCGG ACGTCGATCA TCCGCGACGA ATGGATCGCG
GCGCTCAGCC TCGACCTAAC CGAAACGGAG AGGTTCGATC TCGTCCGATC TGCACTCGGC
GCAGTGGAAG GCTCCGAAGC GGGCAGTAGA GCGAAGGAAG TCGTGGGCGC CGCCGAAGCT
GCCCACTCGA AGGATGAAGC CGCCTATGAT GCTGCTCGGA CGCGGCTGGC GGACAGGCTT
ACGCAACAGT CGGAGACGCA AGCAGCCCTG AGCCGATCGG GCGACGTGTC CGCCGCACTC
AATGTGATCG CTGCAGCGGC GCCAGCTGCG CCCCCCGAGC TGACTGCTCG GCTATCCGCA
GGCAGAAATG CGTTGGCCAA CGCGCGCGCC CGGTTGGCGC GGATGGGGGA AGCGCTTCAA
CTCGGTCGTG AAGTCGCGGC GACTCAGGCG GCCTTCAACG CTCCAGAGGC GCTTGCCGGC
CGCGAGGCCG CATCGGCTGC GCATGAGACT GCCCAGCGCG AGCACGCGGC CGCCCAGAAG
ACCGTTTCGG ATGCCGAAGA GCACCTCGGC CGTGAGGAAG AGATTGACGC GATCGCGGCC
TCTCTGAGCA TTCTCGTGGA GCACGGTGAG CGCCTCGGGT TGCACGACGA CCATTGCCCA
CTTTGCGCGG CCCATCGGAC GTCGGATGAA TTCGCAGCAG GTCTAGCCGC GGCCCGGCAT
CGGATTACTT CGCTTGCGTC GGGTGTTCAA GGAGCCCGCG ATGCTCTCGC GGCCGCCAAA
GAGAACGCCC GGCAACGCAA CCTCGCCCTC AAGGCGGCGG CCGCTGAAGT CGAAGCTCAC
GCAGACGAGT TGCGCCGGTT GCGCGAGCAG GAAGCCGAGC ATGTCGATTT CTACACGCAG
TGGGGTCTCG ATCACCGCTT CATTCAAGAC CCTGAGGGGC TGGAGCAGAC CATCTCCGTC
GAACGCGACC GTTTGATAGA TCTTGAGCGC GCACTCCTGG TGCTGGAGGC TTCGCAAGCT
GTTTCGCGGA TGTCGTCGAT TGAAAGCAAC ATCACGGCGC TGCGGGCCGA TATCGAAAAG
CTTGCCAACG CAGTCAGTCA GTCGCAAAAC GCCGTCACAG CGGCCCGTGA GATCGAACGC
TCGGTCAGGC GTGTGAGCGC CGAAATCATT GATGAGCGTC TTGCTCAGAT CAGTCCACTC
CTCAATGAAC TGTACCAGCG CCTACGACCG CATGCCGACT GGCGGACAAT CGATTACAGC
ATTCGAGGCG ACGTGCGGCG TTTCCTCAGC CTCAAGGTCG GGGACGGCCT GAATCCGCAA
TTTGTCTTCA GCAGCGGCCA GCGTCGCGCG GCAGGTCTCG CCTTCCTTCT CTCGGTCCAT
CTTGCCCGGG CGTGGACACC ACTCAGGTCC CTTCTGCTCG ATGACCCCGT TCAGCACATC
GACGATTTCA GAGCCCTTCA CCTAGTCGAA GTGCTCGCCG CGCTGCGCTT GGACGGGCGC
CAGATCATCT GCGCCGTCGA AGACCCAGCG CTGGCCGATC TGCTGTGCCG GCGCTTGGTA
AGCACCGCGA CGGAGGGAGG CCGGCGTCTC GATATCGACC TTGGATCGCT CGGCGCCACA
AGCGTGGTCA TTGAGCAGGA AGTCCATCCG ATGCCGGTCG GCGTTCTACG TGGTGTCGCC
ACTGCATGA
 
Protein sequence
MRLDFIEICG FRGFRDLVRI NFGRGFTVIT GRNGVGKSTL CDAVEFAIIG SIDKYAVEKA 
AKESLSDYLW WRGEGVPKAH YVIASFIDDD GKPFTITRTR ESGSDRSPEE IQAALCRGPA
PDDALRQLTR TSIIRDEWIA ALSLDLTETE RFDLVRSALG AVEGSEAGSR AKEVVGAAEA
AHSKDEAAYD AARTRLADRL TQQSETQAAL SRSGDVSAAL NVIAAAAPAA PPELTARLSA
GRNALANARA RLARMGEALQ LGREVAATQA AFNAPEALAG REAASAAHET AQREHAAAQK
TVSDAEEHLG REEEIDAIAA SLSILVEHGE RLGLHDDHCP LCAAHRTSDE FAAGLAAARH
RITSLASGVQ GARDALAAAK ENARQRNLAL KAAAAEVEAH ADELRRLREQ EAEHVDFYTQ
WGLDHRFIQD PEGLEQTISV ERDRLIDLER ALLVLEASQA VSRMSSIESN ITALRADIEK
LANAVSQSQN AVTAAREIER SVRRVSAEII DERLAQISPL LNELYQRLRP HADWRTIDYS
IRGDVRRFLS LKVGDGLNPQ FVFSSGQRRA AGLAFLLSVH LARAWTPLRS LLLDDPVQHI
DDFRALHLVE VLAALRLDGR QIICAVEDPA LADLLCRRLV STATEGGRRL DIDLGSLGAT
SVVIEQEVHP MPVGVLRGVA TA