Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2280 |
Symbol | |
ID | 4078464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2397495 |
End bp | 2398535 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638007602 |
Product | hypothetical protein |
Protein accession | YP_614274 |
Protein GI | 99082120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.516476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGC TTCTCTATCT TGTTCACGGC GCGGCCGTGC AGCAGCAACA GCAGCTTTCC TACAGTGTAT TGTCCGCGCT CAAGCACGGT GTAGACGGTG TCGATCTGGT GTTGATCTGC GATGCGGCCA ACCGCCGCCC CGACCTGCCC CTGCGCCATG TTGTGATCTC GCCAGAGCAG ATCGCGCAAT GGACCGACGG CGGGCGCCGA CCAGCGCGCG CGCAGCTTCA TGCCCTGGAT CTGGCCCTTC GGGAGACAGG CGGGCCGGTG TGCTGGGCCT CGACCGATAG CGCATTCACC GCTGCGCCAG CTCGGCTCTT GGAGCGTATC ACGCCAGAGA CGCCTCTGTT TTTTGACCGG GACGGCTGGC TCACGTCTCG ACCGGAATGG AGCCCCATTA TTGACGCCTG CAAGGACTCT GCCCTCTCAG AACAGATCCA ACCCAGCACA GAGGTCTTCG ATACGGGCAT TTTGGGGCTT TCGCCCTCAG ATTTGGATTT CATAAATCAA TCATTGAGCC CCTCTTTTTG GCCCCGCATT CCCAATATTT TTGACAATTT CGAACAAATC CACCTATGCG CGCTCCTTGC GCAAAACGCA CAGGAATTGC GATTCTCTCA CGACCTAGTG CAGCGGTATC AGGGCTATAT GCGACATGTC TACCAAGGGC GGCTGGAGGC AATGTTTCCA CCCGGCGGGG CGGTGAACAC CGCGCTGGCA GCACAGTTGC CTGCTATAAC AGAGCCGCCC AAACCCTTGC ATTTGCGTCT AAAAGCCAAA GCCTATGCGC TCAGACGCGG GCTTGGGCAT GGGACGGAGT TTGGCTATCT CGCCTATCTC TGCGCCTTTG CGGCCCCGAC CCCAGAGGGG CGCAATGTCT GGGCCAATAT CGCTCTGGAC ATGATGGAGC GCTCTGCCCG CGCGCCGCAA AAGCTGGTCA AGGACCTCTC GAAGCTTGCG CCTGAGGCAT TGCCTGATGC GGAGCTTTCA CCACAAACCG AAGAAAGATG GCGCAGGTAT TGGATGAACG CCGGGCTCTA A
|
Protein sequence | MATLLYLVHG AAVQQQQQLS YSVLSALKHG VDGVDLVLIC DAANRRPDLP LRHVVISPEQ IAQWTDGGRR PARAQLHALD LALRETGGPV CWASTDSAFT AAPARLLERI TPETPLFFDR DGWLTSRPEW SPIIDACKDS ALSEQIQPST EVFDTGILGL SPSDLDFINQ SLSPSFWPRI PNIFDNFEQI HLCALLAQNA QELRFSHDLV QRYQGYMRHV YQGRLEAMFP PGGAVNTALA AQLPAITEPP KPLHLRLKAK AYALRRGLGH GTEFGYLAYL CAFAAPTPEG RNVWANIALD MMERSARAPQ KLVKDLSKLA PEALPDAELS PQTEERWRRY WMNAGL
|
| |