Gene TM1040_2279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2279 
Symbol 
ID4078463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2395984 
End bp2397486 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content61% 
IMG OID638007601 
Producthypothetical protein 
Protein accessionYP_614273 
Protein GI99082119 
COG category[S] Function unknown 
COG ID[COG4642] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.416244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTA TCAACTCTAC TCTGCTCCTG ACCTCCCTGC TCTCTTTGAA TGCCGCAGGG 
CTCAGCGCCC CCGCCTTTGC GCAGGACGAA CAGGTTCTGA CCACACAAGA CGAGATCGGT
GGCGTCTATG AGGGCGAATT CAAGGGCGGC CTGCAACATG GTCAAGGGAC CTATAAGCTG
CCAAACGCCT ATGAATATTC CGGCCAGTGG GTCGAAGGCG AGATCAAGGG TAAGGGGGTT
GCCCGTTTCC CAAATGGATC AGTCTACGAG GGTGAGTTTT CCAAGGGGAA ACCCGAAGGT
CTGGGCAAGA TCACCTTTGC CGATGGCGGC ACCTATGAAG GCGAGTGGCA AGACGGTGTG
ATCAATGGCC AAGGCATTGC GATCTATGCC AATGGGGTGC GCTACGAGGG GTCTTTTGTG
GACGCCAAAC ATGACGGGCG CGGGGTGATG CAAAACCCCG GCGGCTACCA ATACGAGGGC
GATTGGGTTG CCGGGCGCAA GGAAGGCACT GGCAAGATCA CCTACCCCGA TGGCACCACC
TATCAGGGCG GCGTCAAGGA CGGCAAGCTG CATGGTCTGG GGACGCTGGT GATGCCTGAT
GGCCTTAAAT ACGAGGGCGA ATGGGCCGAC GATCAGATGA ATGGCACCGG CGTCCTGACG
CAGCCCAATG GCGACGTCTA CGAGGGCCCG CTGGTCAACG GTCGTCGTCA GGGCGAGGGC
GTGCTGCGCT ATGCCAATGG CGATGTCTAC GAGGGCCAGT TCGACGATGA TCTGCGTCAG
GGCGAGGGCA CCTTTACTGG CACCGACGGC TATATCTACA GCGGTCAGTG GCAGGCCGGT
CAGATCGAGG GTCAGGGCAA GGTCACCTAC CCGGATGGGT CCGTCTATGA GGGCGAATTC
CGCGATGATC TGGCGCATGG GGTTGGCAAG ATCACTTACC CCGATGGCTC CACCTACGAG
GGCGAATGGG TCGCTGGCGT GATCGAAGGC AACGGCAAGG CGACCTACGC CAATGGCGCC
ATCTATGAGG GCAGCTTCAA GAACGCCAAA AACGACGGTC AGGGCGTAAT GACATCGCCC
GAAGGCTATC GTTACGAGGG CGGCTGGAAG GACAGCCTGC GCCATGGCGA GGCCAAGGTG
ACCTATGCCG ATGGATCGGT CTATGAGGGC GCGTTTGCAA ATGGCCAGCG CCATGGCTTT
GGCAAGATCA CCCGCCCAGA CGGGTTCAGC TACGAAGGCC AATGGGTCGA AGGCAAGATC
GAAGGCGAAG GCATTGCGAC CTATGCCAAC GGCGACATCT ACGAGGGCAG CTTTGTGGGG
TCCAAACGTC AGGGCCCCGG CACCATGCGC TATGCCTCCG GCCAGGAGGC CTCGGGCACT
TGGAACAATG GCGCGCTTAC CACACCAGAT GCCGCGGCCT CTGAGGCGGA TCAGAGCACG
GATCCGGCCG CCGAGGAGAC GCCTGACGCA GAGGCAGGCT CGGCTGGGGA CGAAAGCAAC
TAA
 
Protein sequence
MIRINSTLLL TSLLSLNAAG LSAPAFAQDE QVLTTQDEIG GVYEGEFKGG LQHGQGTYKL 
PNAYEYSGQW VEGEIKGKGV ARFPNGSVYE GEFSKGKPEG LGKITFADGG TYEGEWQDGV
INGQGIAIYA NGVRYEGSFV DAKHDGRGVM QNPGGYQYEG DWVAGRKEGT GKITYPDGTT
YQGGVKDGKL HGLGTLVMPD GLKYEGEWAD DQMNGTGVLT QPNGDVYEGP LVNGRRQGEG
VLRYANGDVY EGQFDDDLRQ GEGTFTGTDG YIYSGQWQAG QIEGQGKVTY PDGSVYEGEF
RDDLAHGVGK ITYPDGSTYE GEWVAGVIEG NGKATYANGA IYEGSFKNAK NDGQGVMTSP
EGYRYEGGWK DSLRHGEAKV TYADGSVYEG AFANGQRHGF GKITRPDGFS YEGQWVEGKI
EGEGIATYAN GDIYEGSFVG SKRQGPGTMR YASGQEASGT WNNGALTTPD AAASEADQST
DPAAEETPDA EAGSAGDESN