Gene TM1040_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3797 
Symbol 
ID4074948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp49911 
End bp51461 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content52% 
IMG OID638004456 
Producthypothetical protein 
Protein accessionYP_611191 
Protein GI99077932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.134421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCTG GGACGGTTTA TGTAGACCGG CCGGGAACTG GAATTTTTGC AGTCGAATTA 
GGGCCTAGCG GGGCCGGAGC GCTGCTGAAT GAACGCGGCA AACGCGAGCT TGAGATGGTT
TCATATGGTC CACGCCAGAT TTGTCTTCAG GAGGAATGTG TCAATGTGCA GATCGAAGAT
GATCGACTTG CGCTTTATAA GGACAACAGT GACCGGCTCG AGGGTTTTCT GATCTATATT
GCCCTCGGGC GTTTTACGTC ACCGCTTCCG CTTGCACCTG TTAAGGACGC GCAGCAATCA
CAAGAGATAG CTAACGTTGA CGTTTCAAAT GCGGAAGCTA AGCTGGATAT CTACGACACA
GCGGAGTCCA CAGGCAAGGC AATGGACGAA GCTCTTGCCA GCGAAGCCTG CACTGATGGG
GCTTTGTTTT ATGGTACTAG TGACTGTAAA GCAGCGATGG AACGGGCGGT TTCACAAGAC
ATTGGGCGGA TCGTATCCTC TGCGACGGGG CCAACAAATG CAGAGAACGA AAGCCAACCG
GTACTAGATT TGCCCGAAGT AGAATACCTA TGGTTCGGCC GCTCGGCTGA TCTGCAGGTC
CAAAAGCTCG GTACGCGGGA TAAACGCGGA CAAACACGTG TGGATTTTGA AAACGGATCC
CCACGGATCG GTGGGCGTAT ACTTATACCC TCATTTGTAC CTGACAGTGC ACGCATCGTG
GCGGTAAGCG AAGCGTTGGC CTTAATTCGT TTTTCGGGCT CATTATTGCC CAAAGGGTGT
GTTGAAGCAT ACCGCTGGCT TTGGGTTCAG GCATCTGACC ACGGTGTTCT GTCAGAGCCG
TTTGGAGCCT GTACCGCTGC GGAGGATGTC GAGGTACATT ACGAGGGATC TCGGGTCGTG
ACGACTATCA CTCCTGAAGA GGGTATTCCG AGTACCTTCG AAATATTCCC ATACCGAAGT
GACGACATTG ATCCATTGCG GGTATCCGTC ACTTCGGGTC CGTCTGTCGA TTTTGAGCCC
GTGTCTGAAG AAGATTGGAA AACCATTGAG CGCCGTGCAG CAGAGGCACA ACGCATCGCT
ACCGCCGAAG CGGCGAAAGA AGAAGCTCAG ATCCGAGAGG CGGATATGCA AGCCGAAAAA
GAAGCTGCCC TTGCTAGAAG GCGCCAACCT GCAACCCCCA CCGGGAAGCT CGACGATGGA
AATGTCTTTG ACATCCTCGC TCAGGATTCA GTGCAGTCAG CGATTGCTGC CTCCGACGAT
GCAAAAATTA TCCAAAAAGC TTTATCAGAT CGTTTCTATG AGACTGTCTA TCTACCGCAC
AACAAGCACG TTGGTGACAT TTACGTCGGA CTTTCCTGCG GCCCATCGGG ATGCGCCGAA
CTGATGGCTG GCGCTATATA CAATCGAGTC ACGCAAGACG CCTTCGGGTT TGTACAGATC
GACTTTGAAA CTTACAGGTT TGGATCAGAA GGTTGGCTTA TGGCGGACCC TTCGGCGCAG
GTTGTGACAG ATACCTTGAG CAAAATGGTC AAAGCTATTC CCGCTGAGTA G
 
Protein sequence
MLPGTVYVDR PGTGIFAVEL GPSGAGALLN ERGKRELEMV SYGPRQICLQ EECVNVQIED 
DRLALYKDNS DRLEGFLIYI ALGRFTSPLP LAPVKDAQQS QEIANVDVSN AEAKLDIYDT
AESTGKAMDE ALASEACTDG ALFYGTSDCK AAMERAVSQD IGRIVSSATG PTNAENESQP
VLDLPEVEYL WFGRSADLQV QKLGTRDKRG QTRVDFENGS PRIGGRILIP SFVPDSARIV
AVSEALALIR FSGSLLPKGC VEAYRWLWVQ ASDHGVLSEP FGACTAAEDV EVHYEGSRVV
TTITPEEGIP STFEIFPYRS DDIDPLRVSV TSGPSVDFEP VSEEDWKTIE RRAAEAQRIA
TAEAAKEEAQ IREADMQAEK EAALARRRQP ATPTGKLDDG NVFDILAQDS VQSAIAASDD
AKIIQKALSD RFYETVYLPH NKHVGDIYVG LSCGPSGCAE LMAGAIYNRV TQDAFGFVQI
DFETYRFGSE GWLMADPSAQ VVTDTLSKMV KAIPAE