Gene TM1040_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1784 
Symbol 
ID4076813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1877394 
End bp1878758 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content63% 
IMG OID638007099 
ProductDNA repair protein RadA 
Protein accessionYP_613779 
Protein GI99081625 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.199022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA CAACATTCTC CTGCTCCGCC TGCGGGGCCT CCTATTCCAA ATGGTCCGGT 
CGCTGCGAGG GCTGCGGTGA GTGGAATACG ATTTCCGAAG ACAAGGGGCT GAGCTCTGGA
GGACCGGCCA AGAAGTCCCT TGGCGCAATG CGCGGCAAAC GACTGGCGCT GAGCGATCTG
GCCACGCAAG AGACGCCTCC TCCGCGCACC CTTTGTGGTG TGGCAGAGCT TGATCGCGTC
CTTGGAGGCG GCCTGGTCGA TGCGTCGGCC ATCCTCGTGG GGGGCGATCC CGGGATCGGT
AAATCCACGC TGCTGTTGCA AGCTGCGGCG CAATTTGCAC ACGCAGGCCT GAAGACGGTC
TATGTTTCGG GAGAGGAAGC CTCGGCGCAG GTACGGATGC GCGCCCAGCG TCTGGGACTG
GCACAAGCCC CTGTCAAGCT CGCGGCGGAA ACCAACCTGC GCGATATTCT CACCACGCTT
GAGGCGGAAA AACCCCAGCT GGCCATTATC GATTCGATCC AGACCATGTG GGCCGACAAT
GTGGACAGTG CGCCGGGATC CGTCAGTCAG GTGCGCGCGG CGGCCCATGA GCTGACCACT
TTTGCCAAGA CCAATGGTGT CAGCATCATC ATGGTGGGCC ATGTCACCAA GGAAGGCCAG
ATCGCCGGGC CTCGCGTGGT CGAACATATG GTCGACACGG TCTTGTATTT CGAGGGCGAG
CGCGGCCACC AGTTCCGCAT CCTGCGTGCC GTGAAGAACC GCTTTGGCCC TGCCGACGAG
ATTGGCGTCT TTGAGATGAC CGGCGGCGGG CTGGCGCAAG TGGTGAACCC TTCGGCCCTG
TTTCTGTCCG AACGCGGCCA GCCCTCGCCC GGATCGGTGG TCTTTGCCGG TATCGAAGGC
ACCCGTCCGG TCCTCGTGGA AATGCAGGCG CTGGTGGCGC CTTCGCCCCA TTCGCAGCCC
CGCCGCGCTG TGGTGGGCTG GGACAGCTCG CGCCTTGCGA TGATCCTCGC CGTTCTGGAG
GCGCGCTGTG GCATTCCTTT TGCCGGGCTT GATGTCTATC TCAATGTTGC GGGCGGCATG
AAAATCTCTG AACCCGCGGC CGACCTGGCG GTGGCGGCCG CCCTCCTTAG TGCACGCGAG
GACACCGCCC TGCCCGCCGA TACGGCAATA TTTGGCGAAA TATCCCTATC TGGCGCGCTC
AGACCGGCCC CTCAGACCGA AAACCGGTTG AAAGAGGCGC AAAAACTTGG TTTCACGGCA
GCGATCGCTC CGAGCGGTGG CAAAACTGTT TCTGTCCCCG GCCTGACCCT GCGCCCGGCC
GCCGACCTCA CAGGATTTGT TGGCGAATAT TTCGGAGCAG GCTAA
 
Protein sequence
MAKTTFSCSA CGASYSKWSG RCEGCGEWNT ISEDKGLSSG GPAKKSLGAM RGKRLALSDL 
ATQETPPPRT LCGVAELDRV LGGGLVDASA ILVGGDPGIG KSTLLLQAAA QFAHAGLKTV
YVSGEEASAQ VRMRAQRLGL AQAPVKLAAE TNLRDILTTL EAEKPQLAII DSIQTMWADN
VDSAPGSVSQ VRAAAHELTT FAKTNGVSII MVGHVTKEGQ IAGPRVVEHM VDTVLYFEGE
RGHQFRILRA VKNRFGPADE IGVFEMTGGG LAQVVNPSAL FLSERGQPSP GSVVFAGIEG
TRPVLVEMQA LVAPSPHSQP RRAVVGWDSS RLAMILAVLE ARCGIPFAGL DVYLNVAGGM
KISEPAADLA VAAALLSARE DTALPADTAI FGEISLSGAL RPAPQTENRL KEAQKLGFTA
AIAPSGGKTV SVPGLTLRPA ADLTGFVGEY FGAG