Gene TM1040_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1642 
Symbol 
ID4075745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1745947 
End bp1747818 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content59% 
IMG OID638006955 
ProductParB-like nuclease 
Protein accessionYP_613637 
Protein GI99081483 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC ACAGCCCTCT GAACCATCCC GACGCCCCGC TGCAATACCT CCCGTTGTCC 
GAGCTTTACT TGCACGACAT GAACCCGCGC CAGGACACGC CTGACGACGA CGTCGCAGCG
ATGGCGGATT CAATCACCGT GAACGGTCTC TTGCAGAACC TTCTGGGCTA TCGCGACCCC
GCAAACTCCG GCGTCGGCAT TGTTGCGGGT GGCCGTCGCC TGCGCGGGTT GATCCACCTA
GGCAAGAACG GCGCGCAGAT GCTGGACAGC AAAGCACCTG ACCTGTCGGC CATTCCCGTT
CAGGTCACCG ATGATGCGTT CCTCGCCCGG GCATGGGCCG GCACGGAATC CGCAACGCAA
AAACCCTTAC ACCCGGCGGA TGAGATCCGG GCTTATGCTG CATTGGCGGA TCAGGGAAAC
TCGGCCGAGA TGATCGCGCG CACATTTGCC CAGACCGTGC GGCACGTGAA GGGGCGCCTC
GCCTTGGCCC ACCTTTGCAC AGCGACGATC GAGGCACTGC GCCGGGGCGA CATCACGCTT
GATGTTGCCA AGGCCCTCAC CCTCGCCCGT GATCCAGAGC AGGAGCTGGA AGTCCTGCAA
GGCGCGCTCG ACGGCAAAAG ACAGGAGTGG TGGGTCAAGC AGCAACTGAC CGACAGCAAA
GTTCGCGCGA GCAATTACAA GGCTGCGTAT GTCGGTCGGG ACGGCTATGT CGCCGCCGGC
GGGCGCATGC GCGATGACCT ATTCGATGAT GAATCCTATC TCGAGGATCC GGATATCCTG
GACGCCCTCT TCGAAAAGAA ACTTGCCGAC ATCGCCGAGG GTATCAAGGT GCAATTCGGC
TGGAACTGGG TCAAACCACA CTCGGAGTCT CACATCCCAT ATTCACTCAC GTCAAACCTG
ACGCAGGTGG CTGGGAAAAA GACGCCACTG TCAATGAAAG AGCAGAACGA ACAGGCCGAG
CTGCAGGACA AGGAATTCCG CGAGACGCTG ACACCTGCAG AGGCGAAGCG CCTGAATGAG
TTAGAGTCAA AGGCGAAGCC CTTCTGGGAA GAAGCGACTC GCGCGAAATG CGGAATATAC
GTCTATGTCG ATCACCGCGG AAAACTGTGC ATTGACGGCG CGTACAAGGA TCAAAAGAGC
ACCAAGAAAA ACACCAGACC TGGTGATGCC GGCGCGGAAA CCGACATTCC AAGCCCAAAG
CTGACGCAGG CCGGCGTCGA AGATCTTCGC AGGATCCAGC TATTGGCCTT GCAGACAGCC
ACCTTGGGCA AGACAGAGCT GGTTCTGGAT CTCTTCGCGT GGCAGCTCGA ATGCGGCTAC
GCGACCTACA GTGGCCTGTT CAATATCTCC CTGACCGATC CGCAGATCGA ACCTGAGGCG
GAAGGCGCAT GGACCGTAGA CGACGCGCTT TCCGATGGCA GCAATCAGCG GGAGCTCGGG
AGAACGCAGG GCAACTATGC CGAAAGCTTC AAGACCTTCC AAGCCAAGGG CAAAAAGCAC
CGCAACACGG TGCTGACCCG TCACCTGATC CGGACAATCA ACGGCCAAAC CGTTAACGCG
CTTGGAACGT CCCTCTTGGC CCCAATGGTC GAACCTGACA TCCGTTCGGT CTGGACACCG
GACGCCCCCA CCTACTTCAG CCGGATCAGC AAGGCAGCGC TCGAAAGCCT CTGGCAGGAT
CTGGTGCTAG AGGGTCGACC CGAAGCCGAC GCCGCAGACT TCAATGCCAT GAGGAAGTCC
GATAAGATTG AGAACCTGCA TCGCTTGTTC AATGACGCTG ACTTTCGCAA AGCTCTTCAG
CTGGGGCCCG AGGCCGAGCA TCGGATTGCG ACCTGGCTGC CCCCGGAGAT GGAAACCGAG
GCCGCACAAT GA
 
Protein sequence
MTKHSPLNHP DAPLQYLPLS ELYLHDMNPR QDTPDDDVAA MADSITVNGL LQNLLGYRDP 
ANSGVGIVAG GRRLRGLIHL GKNGAQMLDS KAPDLSAIPV QVTDDAFLAR AWAGTESATQ
KPLHPADEIR AYAALADQGN SAEMIARTFA QTVRHVKGRL ALAHLCTATI EALRRGDITL
DVAKALTLAR DPEQELEVLQ GALDGKRQEW WVKQQLTDSK VRASNYKAAY VGRDGYVAAG
GRMRDDLFDD ESYLEDPDIL DALFEKKLAD IAEGIKVQFG WNWVKPHSES HIPYSLTSNL
TQVAGKKTPL SMKEQNEQAE LQDKEFRETL TPAEAKRLNE LESKAKPFWE EATRAKCGIY
VYVDHRGKLC IDGAYKDQKS TKKNTRPGDA GAETDIPSPK LTQAGVEDLR RIQLLALQTA
TLGKTELVLD LFAWQLECGY ATYSGLFNIS LTDPQIEPEA EGAWTVDDAL SDGSNQRELG
RTQGNYAESF KTFQAKGKKH RNTVLTRHLI RTINGQTVNA LGTSLLAPMV EPDIRSVWTP
DAPTYFSRIS KAALESLWQD LVLEGRPEAD AADFNAMRKS DKIENLHRLF NDADFRKALQ
LGPEAEHRIA TWLPPEMETE AAQ