Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3797 |
Symbol | |
ID | 4074948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 49911 |
End bp | 51461 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638004456 |
Product | hypothetical protein |
Protein accession | YP_611191 |
Protein GI | 99077932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.134421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCTG GGACGGTTTA TGTAGACCGG CCGGGAACTG GAATTTTTGC AGTCGAATTA GGGCCTAGCG GGGCCGGAGC GCTGCTGAAT GAACGCGGCA AACGCGAGCT TGAGATGGTT TCATATGGTC CACGCCAGAT TTGTCTTCAG GAGGAATGTG TCAATGTGCA GATCGAAGAT GATCGACTTG CGCTTTATAA GGACAACAGT GACCGGCTCG AGGGTTTTCT GATCTATATT GCCCTCGGGC GTTTTACGTC ACCGCTTCCG CTTGCACCTG TTAAGGACGC GCAGCAATCA CAAGAGATAG CTAACGTTGA CGTTTCAAAT GCGGAAGCTA AGCTGGATAT CTACGACACA GCGGAGTCCA CAGGCAAGGC AATGGACGAA GCTCTTGCCA GCGAAGCCTG CACTGATGGG GCTTTGTTTT ATGGTACTAG TGACTGTAAA GCAGCGATGG AACGGGCGGT TTCACAAGAC ATTGGGCGGA TCGTATCCTC TGCGACGGGG CCAACAAATG CAGAGAACGA AAGCCAACCG GTACTAGATT TGCCCGAAGT AGAATACCTA TGGTTCGGCC GCTCGGCTGA TCTGCAGGTC CAAAAGCTCG GTACGCGGGA TAAACGCGGA CAAACACGTG TGGATTTTGA AAACGGATCC CCACGGATCG GTGGGCGTAT ACTTATACCC TCATTTGTAC CTGACAGTGC ACGCATCGTG GCGGTAAGCG AAGCGTTGGC CTTAATTCGT TTTTCGGGCT CATTATTGCC CAAAGGGTGT GTTGAAGCAT ACCGCTGGCT TTGGGTTCAG GCATCTGACC ACGGTGTTCT GTCAGAGCCG TTTGGAGCCT GTACCGCTGC GGAGGATGTC GAGGTACATT ACGAGGGATC TCGGGTCGTG ACGACTATCA CTCCTGAAGA GGGTATTCCG AGTACCTTCG AAATATTCCC ATACCGAAGT GACGACATTG ATCCATTGCG GGTATCCGTC ACTTCGGGTC CGTCTGTCGA TTTTGAGCCC GTGTCTGAAG AAGATTGGAA AACCATTGAG CGCCGTGCAG CAGAGGCACA ACGCATCGCT ACCGCCGAAG CGGCGAAAGA AGAAGCTCAG ATCCGAGAGG CGGATATGCA AGCCGAAAAA GAAGCTGCCC TTGCTAGAAG GCGCCAACCT GCAACCCCCA CCGGGAAGCT CGACGATGGA AATGTCTTTG ACATCCTCGC TCAGGATTCA GTGCAGTCAG CGATTGCTGC CTCCGACGAT GCAAAAATTA TCCAAAAAGC TTTATCAGAT CGTTTCTATG AGACTGTCTA TCTACCGCAC AACAAGCACG TTGGTGACAT TTACGTCGGA CTTTCCTGCG GCCCATCGGG ATGCGCCGAA CTGATGGCTG GCGCTATATA CAATCGAGTC ACGCAAGACG CCTTCGGGTT TGTACAGATC GACTTTGAAA CTTACAGGTT TGGATCAGAA GGTTGGCTTA TGGCGGACCC TTCGGCGCAG GTTGTGACAG ATACCTTGAG CAAAATGGTC AAAGCTATTC CCGCTGAGTA G
|
Protein sequence | MLPGTVYVDR PGTGIFAVEL GPSGAGALLN ERGKRELEMV SYGPRQICLQ EECVNVQIED DRLALYKDNS DRLEGFLIYI ALGRFTSPLP LAPVKDAQQS QEIANVDVSN AEAKLDIYDT AESTGKAMDE ALASEACTDG ALFYGTSDCK AAMERAVSQD IGRIVSSATG PTNAENESQP VLDLPEVEYL WFGRSADLQV QKLGTRDKRG QTRVDFENGS PRIGGRILIP SFVPDSARIV AVSEALALIR FSGSLLPKGC VEAYRWLWVQ ASDHGVLSEP FGACTAAEDV EVHYEGSRVV TTITPEEGIP STFEIFPYRS DDIDPLRVSV TSGPSVDFEP VSEEDWKTIE RRAAEAQRIA TAEAAKEEAQ IREADMQAEK EAALARRRQP ATPTGKLDDG NVFDILAQDS VQSAIAASDD AKIIQKALSD RFYETVYLPH NKHVGDIYVG LSCGPSGCAE LMAGAIYNRV TQDAFGFVQI DFETYRFGSE GWLMADPSAQ VVTDTLSKMV KAIPAE
|
| |