Gene TM1040_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3581 
Symbol 
ID4075509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp631198 
End bp632346 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID638005101 
Producthypothetical protein 
Protein accessionYP_611812 
Protein GI99078554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.106267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.283892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGG ATATCCTGCA TTCGGGCTTT GATGGGCTTA AATTCACTGT CGAGACCGAT 
ATCCCGCCCG AGCTGCGCAC GGCACTGGCT GAGGCCAAGG CGCAGGCAAT CCAGACCAAT
GCCGAGACCG TTATGGAATT TGGCTCTGTG GCTCTCTCAG TGCGCCGTAC AGGCGGTTCG
GCCTTTTCTG CCCATACTGG GGAGTATGGG GCCGAGTGGT ACTTTCTCGA CCCGGAAAAC
CGCCCTGCAA ACAATCCCGG CATTACCGTG GACTTTCGCG CCTTCCTTCT AGCAACTGGC
GGGCTGGACG CCGCAGAGAA ACACTTTCGC ACCTGCATGG ACGCCTTCGG CATTCGTTAT
GCCGATCATC TCTTGCGCGT GAGCCGTGTG GATTATGCCA TCGACTTTCT GGCCCCTTGG
TTTGAACCAG ACCGCGAGGC TCTGGTGGTG CCACCCGGCA CACGTGTTCA GGAACACACC
GGTATTGATG AAACAGAAAC CCATGCCACC GGTGCGCGCG TCACCGGCCT GCGCGCCGGA
GCCGTCGCCA ACCGGCAGTT GGTGATCTAC GACAAGCGAC AAGAGGTTAT GCAAAAGGGC
AAGCTGGGCT GGCTCACCAT CTGGAACGAC GCCCGCGCCC AGTTGAACCG TCCGCCCCTC
GACCTCACAG ACCGGATGAC CAGCCAAGTC TGGCGCTTTG AGCTGCGCAT GGGATCCAAG
CAACTGCGCA ACCGCTGGGA AATGCGGTCA TGGCAAGACC TACGCGATAT GGTCGGAGAC
GCCTACGCCG AGTTCTGCGA AAAGATCCGC TACACCTGCC CCACCACCGA CAGCAACCGC
GCCCGCTGGC CAACACATGA CCTTTGGCGC GAGGTCGCGA GCGTGATCGC GAATGACCTA
TATGAGAATT GCTCTGGCGT GTTGCCAAGC GAGGTGATCG AGACCAACCG GGCCGAACAC
ATGCGCATGC TGGACCGGCA AATCCTTGGC CTTCTGGTGT CCCGTGCAGC AGCGTCAGAG
GTTCAGCCGC ATGAGTTCGC GGAGTTTCTA GATACGCATA TAGAAGCGAT TGAACGGATG
TCGGAAGAAC ACGCAACACC ACTGGCGGAA CGGATTAGGA AGGCGACAGA GCGGTATCGA
TTCAAATAG
 
Protein sequence
MEADILHSGF DGLKFTVETD IPPELRTALA EAKAQAIQTN AETVMEFGSV ALSVRRTGGS 
AFSAHTGEYG AEWYFLDPEN RPANNPGITV DFRAFLLATG GLDAAEKHFR TCMDAFGIRY
ADHLLRVSRV DYAIDFLAPW FEPDREALVV PPGTRVQEHT GIDETETHAT GARVTGLRAG
AVANRQLVIY DKRQEVMQKG KLGWLTIWND ARAQLNRPPL DLTDRMTSQV WRFELRMGSK
QLRNRWEMRS WQDLRDMVGD AYAEFCEKIR YTCPTTDSNR ARWPTHDLWR EVASVIANDL
YENCSGVLPS EVIETNRAEH MRMLDRQILG LLVSRAAASE VQPHEFAEFL DTHIEAIERM
SEEHATPLAE RIRKATERYR FK