Gene Caul_1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1392 
SymbolrecA 
ID5898847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1480838 
End bp1481908 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID641561879 
Productrecombinase A 
Protein accessionYP_001683020 
Protein GI167645357 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.344472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC AGGCGGCTTT GAAACTCGTG GGCAAAGAAG ACGGCGACAA GCAGCGCGCG 
CTCGAAGCGG CGCTGGCGCA GATCGACCGG GCGTTCGGCA AGGGCTCGGT GATGAAGCTG
GGCGAAAAGG GCAAGGTCGA GATGGAGTCG ATCTCCACCG GCTCGCTCGG CCTGGACATC
GCCCTGGGCA TCGGCGGCCT GCCCAAGGGG CGGGTGATCG AGATCTACGG TCCGGAAAGC
TCGGGCAAGA CCACCCTGGC CCTGCACGTG GTGGCGGAAT GTCAGAAGGC CGGCGGCACG
GCCGCCTTCG TCGACGCCGA GCACGCCCTG GATCCGGGCT ATGCCTTCAA GCTGGGCGTC
AACCTCGACA ACCTGCTGGT CTCGCAGCCC GACAACGGCG AACAGGCCCT CGAGATCACC
GACACCCTGG TGCGCTCGGG CGCCGTGGAT ATCGTGGTCA TCGACTCGGT CGCGGCCCTC
ACGCCGAAGG CGGAAATCGA AGGCGAGATG GGCGACAGCC TGCCGGGCCT GCAAGCCCGC
CTGATGAGCC AGGCGCTGCG CAAGCTGACC GCCTCGATCA ACAAGGCCAA CACCATCGTC
ATCTTCATCA ACCAGATCCG TCACAAGATC GGGGTGATGT ACGGCAGCCC GGAAACCACC
ACCGGCGGCA ACGCCCTGAA GTTCTACGCT TCGGTCCGCC TGGATATCCG CCGCACCGGT
TCGATCAAGA ACCGCGACGA GATCGTCGGC AACAACGTCC GGGTCAAGGT GGTCAAGAAC
AAGGTGGCCC CGCCGTTCCG CGAGGTCGAG TTCGATATCA TGTATGGCGA GGGCATCTCC
AAGCTGGGCG AGATCATCGA TCTGGGCGTC AAGGCCGGGA TCATCGACAA GGCCGGCTCG
TGGTTCTCCT ACAACAGCCA GCGCATCGGT CAGGGCCGCG ACAATGTTCG TGAGTTCCTG
AAGGTCAACA AGGATCTGGC CGCCGAGATC GAGGCCGCCG TGCGCAAGTC CTCCCAGAAG
ATCGAGGAAG AACTGCTGGT CGGCGGCCCT GAGGACGGCG ACGACGAATA G
 
Protein sequence
MSNQAALKLV GKEDGDKQRA LEAALAQIDR AFGKGSVMKL GEKGKVEMES ISTGSLGLDI 
ALGIGGLPKG RVIEIYGPES SGKTTLALHV VAECQKAGGT AAFVDAEHAL DPGYAFKLGV
NLDNLLVSQP DNGEQALEIT DTLVRSGAVD IVVIDSVAAL TPKAEIEGEM GDSLPGLQAR
LMSQALRKLT ASINKANTIV IFINQIRHKI GVMYGSPETT TGGNALKFYA SVRLDIRRTG
SIKNRDEIVG NNVRVKVVKN KVAPPFREVE FDIMYGEGIS KLGEIIDLGV KAGIIDKAGS
WFSYNSQRIG QGRDNVREFL KVNKDLAAEI EAAVRKSSQK IEEELLVGGP EDGDDE