Gene TM1040_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2475 
Symbol 
ID4076840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2614208 
End bp2615395 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content60% 
IMG OID638007799 
Producthypothetical protein 
Protein accessionYP_614469 
Protein GI99082315 
COG category[S] Function unknown 
COG ID[COG3146] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.407377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0590595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG CGCAAATCGA AATCCAGGTG TTGAGCAGCC TGTCGCAGAT CGCGGCATCG 
GACTGGGACG CCTGTGCCTG CCCAGAGGCT GAGGCCGGCG GGCGACCGCT TGATCCCTTT
ACCACGCACC GGTTCCTGAG CGCGCTCGAA GACAGCGGCT CGGTAGGGCA GGGGACCGGC
TGGCAGCCGC AGTACCTTAC CTGCTATCTC GATGGGCAAC TGGTCGCCTG CGCGCCGCTC
TATGCCAAGG GGCACAGTCA GGGCGAATAT ATTTTCGATC ACAATTGGGC GCATGCCTAT
GAGCGAGCGG GTGGGCGCTA CTACCCAAAG CTGCAGGTCG CGGTGCCGTT TACCCCGGCC
ACCGGACGCA GATTTCTTGT GCGTCCAGGC TATGAAGAAA TCGGCATCTC CGCCTTGCTT
CAGGGCGCGG TACAATTGGC GTCTGACAAT CAGCTGTCCT CTCTTCATGT GACATTCTGC
ACCTCCGACG AGGCTGAGGC CGGGCGCGAA ATCGGCCTGA TGTCACGCAG CTCTCAGCAG
TTTCACTGGC TCAATGACGG CTACGCGGGG TTCGAGGCGT TCTTGGCGGC GCTCTCATCT
CGCAAGCGCA AGAACATCCG CAAGGAACGC AAACAGGCCC AGGGGTTTGG CGGCAGTATC
GAAACCTACA CCGGCGCAGA CCTGCGTTCC GAGCATTGGG ATGCTTTCTG GCGGTTCTAT
CAGGATACTG GCAGCCGAAA ATGGGGCACG CCCTATCTGA CGCGTGCGTT TTTCGAAATC
ATCCATGACA CAATGGCCGA GGACATGGCG TTGGTCTTGG CTGAGCGGGA CGGCGTGCCG
GTCGCGGGTG CGCTGAACTT TATCGGGGCC AAGACGTTGT ATGGCCGGTA CTGGGGGTGC
ATGGAACATC ACCCCTGCCT GCACTTTGAG CTGTGCTACT ATCAGGCGAT CGATCTTGCC
ATCGAGATGG GACTGGATCG GGTCGAGGCT GGCGCGCAGG GCGAGCACAA ACTGGCGCGT
GGCTATTTGC CAACCGAGAC CCACAGCCTG CATTGGGTCG CAGATCCGGG GTTTCGTGCA
GCTATCGAAC AATATCTGGA GGCAGAACGG GCTGCCGTAG GAGAAGAGAT CGAGATCCTC
ACCTCCTATG GGCCGTTCAA GAAGACCCAT GTGGAGGAAC AGGAATGA
 
Protein sequence
MDQAQIEIQV LSSLSQIAAS DWDACACPEA EAGGRPLDPF TTHRFLSALE DSGSVGQGTG 
WQPQYLTCYL DGQLVACAPL YAKGHSQGEY IFDHNWAHAY ERAGGRYYPK LQVAVPFTPA
TGRRFLVRPG YEEIGISALL QGAVQLASDN QLSSLHVTFC TSDEAEAGRE IGLMSRSSQQ
FHWLNDGYAG FEAFLAALSS RKRKNIRKER KQAQGFGGSI ETYTGADLRS EHWDAFWRFY
QDTGSRKWGT PYLTRAFFEI IHDTMAEDMA LVLAERDGVP VAGALNFIGA KTLYGRYWGC
MEHHPCLHFE LCYYQAIDLA IEMGLDRVEA GAQGEHKLAR GYLPTETHSL HWVADPGFRA
AIEQYLEAER AAVGEEIEIL TSYGPFKKTH VEEQE