Gene TM1040_2852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2852 
Symbol 
ID4076386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3022996 
End bp3024117 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content57% 
IMG OID638008181 
Productprotein of unknown function DUF900, hydrolase-like 
Protein accessionYP_614846 
Protein GI99082692 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0746757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.750741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACG CCATGCTGGT GCTCTGCGTC TTTCTGGGGC TTTCGGCCTG TGCAGATCGC 
GACCTCATGC CAATTGTGCC GGAGGCCGTT GAAATCGGCA CACCCTATAC CGTGTTCTCC
ACCACCACCC GCGCACAAGA AGCGGATGGC ACCTATGGTT TTAGGCGCTC CGAGCAGCTG
CGGATGATGG AAATGACCGT GACCATCCCG CCAAATCACA GCCCGGGAGA GCTCTCCTAT
CGCTACAAGA ACCCGGACCC ACACAAGAAT TTTGCAATTG CCGAAGAGCT ACCGGTCGCG
AGTGTCAGTG ACCTTCGGGC CCGGTTTCTG CGCGAGCAGC GGGAAAACAA CTGGCCTCTG
CGCGAGGTGA CAATCTTTGT GCATGGCTAT AACAGCACGC ACCCGGAAAC CGCCTATCGC
TCGGCGCAAA TGGCGCATGA CGTAGAACTT CCCGGATCCC TGGTGATGTA TTCATGGCCC
AGCCGTGGAC GTGCTTTCGG CTATGCCTAT GACATCGACA GCATGCTCTT TGCCCGCGAC
GGGCTTGAGG AGACCGTGCG GCGCGTCAAA CAGGCGGGGG CAGAGCGCAT CATTCTGGTA
GCGCATTCCA TGGGAACCGC GCTTGCGATG GAGATGTTGC GCCAGGCCGA TCTGCGAAAT
CCCGGCTGGG CGGCCCGCAC GCTCAACGGT GGGGTCATCC TGATCTCACC CGATCTCGAT
GTGGATGTAT TTGAGAGCCA GATGATGGAT CTCAAACAGG TTCCGCAGCC ATTTGGTGTG
ATGGTCTCGG AGAAAGATCA TATCCTCAAT ATTTCCGGTC GACTGCGTGG CACAAGCGAG
GGGGAGCGTC TTGGAAATAT CAAATCAGCA GAGCGGCTGA TGAAATGGCC CATTGAGGTG
ATCGACCTGA CAGCATTTAA TGCGGATGCA GCCTCTGGGC ATTTTGTGGC GGCGACATCG
CCTAGTTTGC TGGCGGTCAT ACGATCCGCC AGCAACGTCA GTCGCCTGTT CGGCCCAGTT
GATCCGACGC TGTTTCAACA GATCCTACCG CAGTCTCAAA CCATCGTAAC CGACCATGGG
AAACTCATGC TCGCCAGGCA GCAACGGCAG GAAGAGCGCT GA
 
Protein sequence
MKNAMLVLCV FLGLSACADR DLMPIVPEAV EIGTPYTVFS TTTRAQEADG TYGFRRSEQL 
RMMEMTVTIP PNHSPGELSY RYKNPDPHKN FAIAEELPVA SVSDLRARFL REQRENNWPL
REVTIFVHGY NSTHPETAYR SAQMAHDVEL PGSLVMYSWP SRGRAFGYAY DIDSMLFARD
GLEETVRRVK QAGAERIILV AHSMGTALAM EMLRQADLRN PGWAARTLNG GVILISPDLD
VDVFESQMMD LKQVPQPFGV MVSEKDHILN ISGRLRGTSE GERLGNIKSA ERLMKWPIEV
IDLTAFNADA ASGHFVAATS PSLLAVIRSA SNVSRLFGPV DPTLFQQILP QSQTIVTDHG
KLMLARQQRQ EER