Gene TM1040_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3553 
Symbol 
ID4075229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp597614 
End bp598792 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID638005065 
Productphage integrase 
Protein accessionYP_611784 
Protein GI99078526 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.631113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.139012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGA CCATTTCGCC TGCGTTCACC TTCGTGAAGG ATGGCGTTTT CTACTTCAGC 
CGCCGCATCC CCAAAGACCT GAAGAAGCAC TACACCTCAC CTCGCATTGC CTACTCATTA
CGAACGCGGT CTGCTCGCAT CGCTGAAGCG CGGGCCAGAA GAGCCGCCGA TCAGTTGGAT
GAGTACTGGT ATCACCTCCG CTCAAAGGAT GTGGAACTGC CGGGTAAGCA TATGCTGAGA
ATGCAGTCCG AGGGGCGTGT CGCGGTCAGT GCAGCAGCCC CTGAGATTTC CTCTTCATCT
GTTCTTTTGT CCGAAGCTGT CGGCATCTAC CTGCGCCTGA AAGGGCAAGG ACGACCAGAG
ACATTCCACC GCGCAGCGGA ACGATCTTGC GGCTATGTCA TCGACGCCTG CGGTGACAAA
CAGCTCGACG CCTTCACCAA GGCCGATGCC AACAAGTTCA GGGATGCGCT CATCGAACGT
GGATTGGCTG GCAGCAGTAT AACACGCATA TTTGGGACCG TTCGCGCGGT CACAAACTTT
GCAGCCAGTG AGCTTGGCCT GACCCTGACC AATCCTTTCA ACGGGGTCTA CTACGACCGA
GAGGCTGGCG TAAGTGATCG CAACCCAATC CCAGCGGATG CACTCAAAGT GGTTCAGGGC
CAGTGCCGCC AACTGGACGA TGACATGAGG TGGCTTGTAG CGCTCGTGTC GGACACGGGA
ATGCGGCTTG CTGAGGCCGC AGGAATGGCG AGACAGGACA TTGAACGGCG GTCAGATGGT
TCTCTGGTTG CCTGGGTGCG TCCACACCCG TGGCGGCGTC TGAAAACCAA GGGCAGTGAG
CGTGTCGTCC CGTTGGAGGG GCAAGCCAAG TGGGCTGCAG AGCGTTTGCT GAGCGAGGTC
GTTGAAAGCG ATTTCCTCTT CCCGCGCTAC AACAGAAAGC CCCAGACAAA TGCCAACGCT
GCCAGCGCTG CACTGAACAA ATGGATAAAG CAAATGACAC CAGAAGGATG CACTATGCAC
AGCTTCCGCC ATTCGATGCG GGATCGTCTT CGCGCCGTTG AGTGTCCTTC CGACATCGTC
GATCAGATCG GGGGCTGGCA GACTGATGGT GTCGGCCACG GGTATGGTTC TGGTTACCCG
GTTGAAGTTC TGCAGAAGTG GATGAAGGCA GTCACCTGA
 
Protein sequence
MSETISPAFT FVKDGVFYFS RRIPKDLKKH YTSPRIAYSL RTRSARIAEA RARRAADQLD 
EYWYHLRSKD VELPGKHMLR MQSEGRVAVS AAAPEISSSS VLLSEAVGIY LRLKGQGRPE
TFHRAAERSC GYVIDACGDK QLDAFTKADA NKFRDALIER GLAGSSITRI FGTVRAVTNF
AASELGLTLT NPFNGVYYDR EAGVSDRNPI PADALKVVQG QCRQLDDDMR WLVALVSDTG
MRLAEAAGMA RQDIERRSDG SLVAWVRPHP WRRLKTKGSE RVVPLEGQAK WAAERLLSEV
VESDFLFPRY NRKPQTNANA ASAALNKWIK QMTPEGCTMH SFRHSMRDRL RAVECPSDIV
DQIGGWQTDG VGHGYGSGYP VEVLQKWMKA VT