Gene TM1040_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0020 
Symbol 
ID4078683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp21907 
End bp23346 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content58% 
IMG OID638005307 
Producthypothetical protein 
Protein accessionYP_612015 
Protein GI99079861 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0116808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.937022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG ATTTCACATC CTTCGTGGTG CTGGCAGAGA TGCGCACCGG CTCAAACTTT 
CTCGAAGCCA ATCTGAATGC TCTTAGAGGC GTGAGCTGTC TGGGAGAGGC GTTTAATCCT
CATTTCATTG GCTATCCAAA CCAAGAGTCC ATCTTCGAGG TTTCCAGAGA AGCACGCGAT
GACGATCCGC ATGCGCTCGT GGACGCGATC CACGCGGCGG AGGGGCTGTG CGGCTTTCGC
TATTTCCATG ACCATGATCC CCGGATCCTT GAAAGGATCC TGAACGACCC GCAGGTTGCC
AAGATCATTC TGACCCGGAA CCCGCTCGAC AGCTATGTAT CGTGGAAGAT CGCTCGGACG
ACCGGGCAGT GGAAGCTGAC CAATGTAAAG CGACGCAAGG AGGCCAAGGC GGTCTTTGAT
GCTGAGGAAT TTGGCGCCCA TGTGGATGGC CTCCAAGAAT TCCAGATCGC CGTCTTGAAC
CAATTGCAAC GCACGGGGCA GACGGCTTTT TATGTCGCTT ATGAGGATCT TCAGAGCCTT
GAGGTCATGA ATGGTTTGGC GACTTGGCTA GGGGTGGATG CGCGGCTTGA TGCGCTGGAC
GACAGTCTAA AACGTCAAAA CCCCGCGCCC GTCATTGCCA AGGTCGAAAA CCCCGAGGAG
ATGACCGAGG CCCTTGCGGG GATGGACCGG TTCAACCTCA CACGCACGCC GAATTTTGAG
CCACGTCGCG GTCCCGCCGT TCCCGGATAC GTTGCGGGAG TGGTGACGCC GATCCTCTAT
ATGCCAGTGC GGAATGGGCT TGAGGCTCAT GTGAGCGAGT GGATCGGGGG GCTCGACAAA
GTCCCGGCCG AAGGGTTGAT CACCGGCATG AACCAAAAGC AGCTTCGCCA GTGGCGTCAT
GCGAATCCGG GGCACCGCAG TTTCACGGTT GTGCGTCACC CATTGGCGCG TGCGCACCAT
GTGTTCTGTA CCAAGATCCT GCCGAATGGG CCCGGAAGCA TGAAGCATAT CCGCAATACG
TTGCGGCGCC AATTTCATCT GGATATTCCG CAAGATGGCA TTGATGTGGG CTATTCTCGC
GAGAGACACC GTCAGGCGTT TGAGCAGTTT TTGACATTTC TCAAGTCCAA CCTTGCGGGT
CAGACCGCGA TCCGGGTGGA TGCACGCTGG GCAAGTCAGG CGCAGAGCAT CAGCGGGTTT
GCTGAGTTGG GTGCGCCGGA TCTGATCCTG CGTGAAGAGG ATCTTGCAAC AGATCTACCG
TGGCTTGCGC GCAAGCTGGG TCGGATGTCT CCGGCGGAGG TTCCCCCTGT GCCCGCGGAT
CAGCCGATCG CACTGGCGGA GATTTACGAT GACGCGCTCG AGGTCCTGTG CCGGTCGATT
TACACCCGCG ACTATTTGAC GTTCGGTTTT GACAGTTGGT CGCCCAGGAT CGAGCGCTAG
 
Protein sequence
MSADFTSFVV LAEMRTGSNF LEANLNALRG VSCLGEAFNP HFIGYPNQES IFEVSREARD 
DDPHALVDAI HAAEGLCGFR YFHDHDPRIL ERILNDPQVA KIILTRNPLD SYVSWKIART
TGQWKLTNVK RRKEAKAVFD AEEFGAHVDG LQEFQIAVLN QLQRTGQTAF YVAYEDLQSL
EVMNGLATWL GVDARLDALD DSLKRQNPAP VIAKVENPEE MTEALAGMDR FNLTRTPNFE
PRRGPAVPGY VAGVVTPILY MPVRNGLEAH VSEWIGGLDK VPAEGLITGM NQKQLRQWRH
ANPGHRSFTV VRHPLARAHH VFCTKILPNG PGSMKHIRNT LRRQFHLDIP QDGIDVGYSR
ERHRQAFEQF LTFLKSNLAG QTAIRVDARW ASQAQSISGF AELGAPDLIL REEDLATDLP
WLARKLGRMS PAEVPPVPAD QPIALAEIYD DALEVLCRSI YTRDYLTFGF DSWSPRIER