Gene TM1040_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3044 
Symbol 
ID4075138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp13418 
End bp14599 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content59% 
IMG OID638004545 
Producthypothetical protein 
Protein accessionYP_611280 
Protein GI99078022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAGT TTACGGAAAC AACGCAGGTC AGCCTGTCGG TCCGATTAAA GAACGCGTTC 
AAAGGCATCG GCCTCGGGAT CAGCTTTATC GGCATTGCTC TCTATTTTCT GTTCTGGAAC
GAAGGCAATG CCGTGCGCAC AGCGCGGGCG CTGGACGAGG GCGCGGGGCA AGTTGTGTCG
TTGGACAGCG CAACATTGGA CCCCACGTTC GAAGCCCGTC TCGTCCATAT CAGTGGCCCC
GCAAAGCTTA AAGCAGCGCT TGTTGATGCC GCGCTTGGAG TGGAGGCCCC GGCGCAAACG
GTGCGCCTCG AACGTATCGT GGAGCAATTT GCCTGGATCG AAGAAACGCA AACCTGGACC
GACACCAAAC TCGGAGGAGG GCAGGACAAA ACGACCACAT ACACCTATCG GATGGATTGG
ACCGAAACCC CCGCGAGCGG GGCCGCGTTT CGAGTGTCCG AAGGTCACAT GAACCCTCCG
ATGCCGATCC GTTCCAAAAT CCTGCGCCAG CAAGACGCAA CGGTCGGCGC TTACCGCGTG
TCAGAAGAGA TCTCTGATCT GGGCGGCGCG ACACCGGTGA TATTGACCGA AACACAGGCT
GCTGAGATCG CAGAGGCTCT GCCGCTTTCT CAAACGGCCA AGCTGGTTGC CGGGCAAGTT
GTGTTTGGTG AGACCGTCGC GCGCCCGGCA CTTGGGGACA TCCGACTGCG CTACCAAGCC
GCCAGAATTG ACAGCGCCAG CGTCATTGGC CTGCAACGCG GCAATGCTCT GGTGCCCTAT
ATCGCGCAGA ACGGTCGCAA GATCCACTTG CTCACGGAAG GAATAAAAAC CGCCGAAGAG
ATGTTTGAGA CCGCGCAGCG CGCCAACACG GCCAAGACAT GGATGCTGCG CATCGGCTTG
CTGGTTCTGC TCTTTCTAGG CTTCAAAGCG CTCTTTGGCG TTGTGGATGT GCTTGCAAGC
ATTCTGCCCG TTCTGGGCTG GGTCTCGTCG TCGGTCACCT CGCTTATCAG CGTTGCATTG
GCGTTCTGCC TTGGTGGTCT CACGATGGCA ACGGCCTGGT TCTATTATCG CCCAATCGTG
TCTCTGGCGC TGATCGCAGT TGCCTTGGCT GTTGGTCTCG TTGGCGCGCT CTGGCTGCGC
TCATCCGCAA AACATGCACC TCATCCCCCC GGAACCACGT GA
 
Protein sequence
MSQFTETTQV SLSVRLKNAF KGIGLGISFI GIALYFLFWN EGNAVRTARA LDEGAGQVVS 
LDSATLDPTF EARLVHISGP AKLKAALVDA ALGVEAPAQT VRLERIVEQF AWIEETQTWT
DTKLGGGQDK TTTYTYRMDW TETPASGAAF RVSEGHMNPP MPIRSKILRQ QDATVGAYRV
SEEISDLGGA TPVILTETQA AEIAEALPLS QTAKLVAGQV VFGETVARPA LGDIRLRYQA
ARIDSASVIG LQRGNALVPY IAQNGRKIHL LTEGIKTAEE MFETAQRANT AKTWMLRIGL
LVLLFLGFKA LFGVVDVLAS ILPVLGWVSS SVTSLISVAL AFCLGGLTMA TAWFYYRPIV
SLALIAVALA VGLVGALWLR SSAKHAPHPP GTT