Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3714 |
Symbol | |
ID | 4075421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 773015 |
End bp | 774634 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005234 |
Product | hypothetical protein |
Protein accession | YP_611943 |
Protein GI | 99078685 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.114307 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTGA TCCTCTCCGC CCTTGAAATC CTGATGCGCT GGGACGTGGC GCTGGCGCTG CTGGCAGGTT CTGTTGGCGG GGTACTGATC GGGGCGATCC CCGGCGTAGG CCCGGCCGTT GCCATCGCGA TCCTGCTACC TGCGACCTTT TCGCTCGATC CGATCGTGGG GCTCACGGTG CTCTTGGGGA TCTATGGGTC CTCCATGTAT GGCGGTGCGA TCCCGGCGAT CCTGATCAAT ACGCCCGGCA CTGCGGTCAA CGCGCTGACG ACCTATGACG GCTACCCGAT GACGGCCCGC GGCGAGCCTC GGCGCGCCCT TAGCCTTGCC TATTCCGCGA GCTTCTTTGG CGGCATCTTT TCGGTGATCT GTCTGATCCT CTTTGCGCCC GTACTGGCCA AGGTGGCGCC GCTCTTTGGC GCGCGCGAAA TTTTTCTCGC AGCCCTTCTC GGACTCATTC TGGTGGTGGT CGCACACCGG GGCCAAGCCC TGATAGCCGG GGCGCTCGCG TGCCTTGGGA TCTTTCTCAA TACGATCGGC ATGGAGCCGG TGAAATACAC CCAACGCTAC ACCTTTGGAA CGGATGCGCT GGGGTCAGGC ATCAATCTCA TCGTGGTGGT GCTGGGGCTT TTTGCGATCT CGCAGGCCTT TATCCTCTTG ACCGATGAGG ATGAGAAAAT CCGCCTCACC AAGCTGCGCG GTGGAGTGTT TCAAGGTCTG CGCGAGCTGG CGCGCCACCC GCGGGTGGCG TCTGTTTCGG CGGGCTTTGG CGTGGTGATG GGAATGATCC CGGGCGTGGG TGAGTTCACG GCGCAGTTCA TGTCCTATAC CTACGCGCAA AAGACCTCAA AACGGCCGCA GGATTTTGGC AATGGCTCCT CTGAGGGGCT GATCGCTGCC GAAACCGCCA ATAACGCCGT GCCCGCTGCT GCCATGGTGC CGCTTCTCGC GCTTGGCATT CCGGGCGAGG CGCTGACAGC GATGATGCTT TCGGTCTTTT ACGTCCACAA TGTCGTGCCA GGGCCGGGGC TGTTCCAAAA CCAGATGGAT TTTGTCGTCG CGCTTTATCT GGCGCTCTTG ATCCTCAATG TGCTTGTGCT TGTCTTTCTG CTGGCGGCCA CCAAATCGCT GGTTCAAGTG GTGCGCATTC CCAACCGCTT CCTGGGTGTC GGCATTCTCA CGCTCAGCTT TGTGGGGGTC TATTCGCTGC GCAACTCGGT CACCGATTGC TTCATGGCAG CCGGGTTTGG GCTCTTTGGC TTTATCCTCA AGCGGTTGCA GCTGCCCGCC GTGCCGATCA TCCTCGGCAT GGTTCTGGGC GGCATCATGG AAGTGAAACT GCGCGCCGCC ATGGCACGGG TAAAGACTCC CTTTGATTTC ATTGATCGCC CGGTGGCGTT CATCCTGTTT TCCCTCATCT TGATCGTGCT TGCGGCGCAT CTCTTTCGTA TCGTGAAAGA GGCGCGCGAG CTGAAGGAGG CCGAATGGCA AACCAAGACC TCATCAACAC GCTTCGGAGC ACTGCTGTTG CGCCGCAGAG CCTCTTTCTT GAAGGAACAT GGCAAGAAGG TTCGGGCGCG CCATGCGAGA CTACTTCGCC GATTGACGGA TCTGTTCTGA
|
Protein sequence | MDLILSALEI LMRWDVALAL LAGSVGGVLI GAIPGVGPAV AIAILLPATF SLDPIVGLTV LLGIYGSSMY GGAIPAILIN TPGTAVNALT TYDGYPMTAR GEPRRALSLA YSASFFGGIF SVICLILFAP VLAKVAPLFG AREIFLAALL GLILVVVAHR GQALIAGALA CLGIFLNTIG MEPVKYTQRY TFGTDALGSG INLIVVVLGL FAISQAFILL TDEDEKIRLT KLRGGVFQGL RELARHPRVA SVSAGFGVVM GMIPGVGEFT AQFMSYTYAQ KTSKRPQDFG NGSSEGLIAA ETANNAVPAA AMVPLLALGI PGEALTAMML SVFYVHNVVP GPGLFQNQMD FVVALYLALL ILNVLVLVFL LAATKSLVQV VRIPNRFLGV GILTLSFVGV YSLRNSVTDC FMAAGFGLFG FILKRLQLPA VPIILGMVLG GIMEVKLRAA MARVKTPFDF IDRPVAFILF SLILIVLAAH LFRIVKEARE LKEAEWQTKT SSTRFGALLL RRRASFLKEH GKKVRARHAR LLRRLTDLF
|
| |