Gene TM1040_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2997 
Symbol 
ID4078027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3164896 
End bp3165993 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID638008326 
Producthypothetical protein 
Protein accessionYP_614991 
Protein GI99082837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.516476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGC TCGGTCTATG GCATAAAGAG GCGCAGTATT TCGACGCCGC TGGACTCACC 
CCGTCGGTAG AACGCAGCTG TCTCTTGTTG CGCGAAGCCG ATGCACTGTT GCCGCGGGGC
GCGCTGGTTC TGGAGGTGAG CCTGCCGGAA TTGCGCAAAC CCGAACCCCT GGTGCTGTTT
GAGCGTGGAG GCGACTGGCC TTTGCGATTT CAGCTCTCGG CGGTGCCGGG GGGCGGTATC
AATCTGGTGC TGGAGCAATA TGGCGCGGTG TTTCACCAAA CGCTGAACCC GACCAAGCGG
GGGCGTGCGG ATCAGGTGCG GCTGACCTAC AACTGGGACG CTCCGGCGTT TGAAGGCCAG
CTTGCGCTCG AATGGCTTGA TGGAGATCGG GCCGAAATCG CGGATATTCA TGCCCCACGC
CCCTGGCGGC TTGCGGATCT TGAGGCTTTG ATTGAAGGGG GGCCGCACTG CTTTATCGCC
TCCGGCGTCG AATACATTGC ACTTTCTGAT CAGCCGGAGC CGGTGGGGCC GATGCCCGGC
CTCTCGCCGT TTACACCGGT GGAAACGGAA ACGGGGGCGC GACCGATCAA GGACCTGCGT
CGAGGTGACT TGCTGCGCTG CGCAAGTGGT GATTTGGTCC CCGTCTTGCA TAAAATCGAA
CGTGAAGTGC CAGCGGTGGG CAGCTTTTGT CCCGTGCAGC TTCGGGCGCC ATATTTCGGC
CTGACACAGG ATATTACCGT CGCGCCTTTC CAACGCATGG TGCTCACCGG GTCTGAGGTT
GAATATCTCT TTGGATGTGA GGCTGTGCTT GCGCCGGCAG AAATGCTTGC GGCCACGCGC
ACCGCCCGTC GGGTCTTGCC CGCGGGGCCG ATCACGACCT ATGCGCAGGT GATCCTGCCC
GGCCATGAGG TGCCAGTTGT GGCGGGGCTC GGGGTCGAGA GCCTTTTTCT GGGGCGCATT
CGGCGCGACC GTTCCGCACT TGGGGCCAGC CTCTTTGCAG GGCTAGATCG CAACTCTCTG
CCAGAACACG CACAGCCGCG CTACCCCGTG GTGCGCGCGT TTGATGCCGC AATCCTTGCA
GAACATCGCA CCGCCTGA
 
Protein sequence
MSWLGLWHKE AQYFDAAGLT PSVERSCLLL READALLPRG ALVLEVSLPE LRKPEPLVLF 
ERGGDWPLRF QLSAVPGGGI NLVLEQYGAV FHQTLNPTKR GRADQVRLTY NWDAPAFEGQ
LALEWLDGDR AEIADIHAPR PWRLADLEAL IEGGPHCFIA SGVEYIALSD QPEPVGPMPG
LSPFTPVETE TGARPIKDLR RGDLLRCASG DLVPVLHKIE REVPAVGSFC PVQLRAPYFG
LTQDITVAPF QRMVLTGSEV EYLFGCEAVL APAEMLAATR TARRVLPAGP ITTYAQVILP
GHEVPVVAGL GVESLFLGRI RRDRSALGAS LFAGLDRNSL PEHAQPRYPV VRAFDAAILA
EHRTA