Gene Dshi_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1643 
SymbolrecA 
ID5713208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1704693 
End bp1705760 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content59% 
IMG OID641267559 
Productrecombinase A 
Protein accessionYP_001532986 
Protein GI159044192 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.883064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000602005 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCAGCG CGAGTCTTTT GGACGTGACC AACAATCGTA ACGCAGACAA GCAGAAAGCT 
CTCGACAGCG CCTTGGCGCA GATCGAGCGG CAATTCGGCA AGGGCTCCAT CATGAAGCTC
GGCGCCGACA GTCCCGTGGC TGAGATCGAA GCGACCTCGA CCGGGTCGAT CGGATTGGAT
ATCGCGCTCG GAATCGGGGG TATTCCCAAG GGCCGCATCA TCGAGATCTA CGGCCCTGAA
AGTTCCGGCA AGACCACGCT GACCCTGCAC TGCATTGCCG AAGAGCAGAA AAAGGGTGGT
GTCTGCGCGT TTGTGGACGC AGAGCACGCG CTGGACCCAC AATACGCCCG GAAACTTGGT
GTAGATCTGG ACGAGTTGCT GATTTCTCAG CCCGATACCG GCGAGCAGGC TCTCGAGATT
ACGGAGACTC TCGTCAGGTC CGGGGCGGTC AGCATGGTGG TCGTCGACTC GGTGGCTGCG
CTTACCCCGA AATCCGAGCT CGAAGGCGAT ATGGGGGATG CCCAAGTCGG CGCCCAGGCG
CGCCTCATGA GTCAGGCCAT GCGCAAACTG ACCGGCGCGA TATCGCGATC CAACTGCACG
GTGATCTTCA TCAACCAGAT CCGCATGAAG ATCGGTGTGA TGTTCGGCTC ACCCGAAACC
ACGTCGGGCG GCAACGCGCT GAAGTTCTAT TCGTCTGTCC GGCTCGACAT CCGCCGCATT
GGATCGGTCA AGGATCGCGA CGAAATCGTC GGGAACACCA CGAAGGTCAA GGTCGTGAAG
AACAAGGTGG CCCCGCCTTT CAAACAGGTT GAGTTCGACA TCATCTACGG GGAAGGCATC
TCCAAAATGG GCGAGTTGAT CGATCTCGGC GTGAAGGCTG GCGTCGTGCA GAAATCGGGG
TCTTGGTTCT CTTATGGAGA CGAGAGGATC GGTCAGGGAC GGGAGAACGC AAAGCAGTAT
CTCCGCGACA ACACGAGGAC GGCGCTTGAG CTCGAGGACA AGATCCGCGC AGCGCACGGG
CTGGATTTCC AGATGCCCGA CAGCGAGGCC GAGATCCTCG ACGACTGA
 
Protein sequence
MASASLLDVT NNRNADKQKA LDSALAQIER QFGKGSIMKL GADSPVAEIE ATSTGSIGLD 
IALGIGGIPK GRIIEIYGPE SSGKTTLTLH CIAEEQKKGG VCAFVDAEHA LDPQYARKLG
VDLDELLISQ PDTGEQALEI TETLVRSGAV SMVVVDSVAA LTPKSELEGD MGDAQVGAQA
RLMSQAMRKL TGAISRSNCT VIFINQIRMK IGVMFGSPET TSGGNALKFY SSVRLDIRRI
GSVKDRDEIV GNTTKVKVVK NKVAPPFKQV EFDIIYGEGI SKMGELIDLG VKAGVVQKSG
SWFSYGDERI GQGRENAKQY LRDNTRTALE LEDKIRAAHG LDFQMPDSEA EILDD