Gene TM1040_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1831 
Symbol 
ID4076977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1927865 
End bp1928956 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content63% 
IMG OID638007146 
Productprotein of unknown function DUF900, hydrolase-like 
Protein accessionYP_613826 
Protein GI99081672 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.798525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAT TTCTGCCAGC CGTTGCCCGT TCGATATCTG TGATCCTCTG CCTTGGCGGT 
CCATTTGGCG TGATCCCCGC GCCACTGGCG GCCCAGTCTG AGGCTCAATC GGAGGCCCAA
TCTGAAACCC AGTCGGAGAT TGAGGTTGCA GAGTTTCCCT ATGTCACGCT CCGGAACCGG
ACCGGATCTG ATGATCCGGC GGAGTTCTAC GCAGGTGAGC GCAGTGATCC CAAAGCCGGC
CGCTGTCGCG TCGAAGAACT CGACCTTGGC GTGCTTGCGC CCCTTGCCGG TGTCGCTCCG
AATTTCCTGC GCGAGGAGCT GTTGCGCGTT CAGGCCATAG AGGAGGCCCC CACCGGCGCC
ATTCTGGACC AGCTCGAAGC GACCGCCGGG GCACAGGGGC CTGCGCTCTA TGTCCATGGC
TACTACATCA GTTTTGAAAA AGGCTGCCGC CGGGCTGCGC TGTTGCAGCA GAACGCGGAC
CTTGAGGGGC GGCTTTTGTG GTTCAGCTGG CCCTCGGATG GGGCCGCCGC CTATTACACG
CACGATGAGG TCGATCTCTA TTGGAGCCTG CCGGACCTCG CGGACACGAT TATCGAATTA
CACGAGCGCT TTGGCCCCGG CGAGGTTGCG GTCATGGGGC ACAGCCTCGG GGCGCGCGGG
GTCGTGCTGG CGCTGGCCGA GGTGGCCAAT CGGCGCCCCG ATATGCAGCT GGGTCAGGTC
GTGCTGCTGG CGCCGGATAT GGACTTTGGG ATCTTTGAAC GCATCCTGCC ACGCATTCGC
CCAATCGCAG AAAACCTGAC CATCTATGTC ACCAGCGGTG ACCGACCGCT TGCGCTTTCG
GCGCAAGTGC ATGGCTACCC GCGGCTCGGG GAGGCGGGAA ACCCGGTGTC GCGTCTCACG
GGCGTCGAGG TGATCGATCT GAGCGACTTG CCCAGCGAAG GCCCGACGGG GCACCTCTAT
CATATCTACA GCCAGATCGT GGGCGCGGAT CTGAGCCGGC TTTTGCGCAG CGGCGAGGGG
GCGTCCGAGC GTCCGGGCCT TGTGGCTCAG AGCAAAAACC TATGGCGCCT CAGGCCTGAA
AAACGCGAGT AG
 
Protein sequence
MKTFLPAVAR SISVILCLGG PFGVIPAPLA AQSEAQSEAQ SETQSEIEVA EFPYVTLRNR 
TGSDDPAEFY AGERSDPKAG RCRVEELDLG VLAPLAGVAP NFLREELLRV QAIEEAPTGA
ILDQLEATAG AQGPALYVHG YYISFEKGCR RAALLQQNAD LEGRLLWFSW PSDGAAAYYT
HDEVDLYWSL PDLADTIIEL HERFGPGEVA VMGHSLGARG VVLALAEVAN RRPDMQLGQV
VLLAPDMDFG IFERILPRIR PIAENLTIYV TSGDRPLALS AQVHGYPRLG EAGNPVSRLT
GVEVIDLSDL PSEGPTGHLY HIYSQIVGAD LSRLLRSGEG ASERPGLVAQ SKNLWRLRPE
KRE