Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3580 |
Symbol | |
ID | 4075508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 629292 |
End bp | 630824 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638005100 |
Product | hypothetical protein |
Protein accession | YP_611811 |
Protein GI | 99078553 |
COG category | [S] Function unknown |
COG ID | [COG4938] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00103089 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.191716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGTCG GAAGAAATAG CGCTGGCAAA AGCACTTTTT TAAGATCCCT TCCGCTGATT CGACAGTCGT TAGAGACAAG GAGTAGTGCA CCAATTCTAT GGTACGGCGA CTTCGTTGAC TTTGGAGATT TTCAAACTGC CGTATCTTCT GAGGCCAAAG ACGGCTATGC TGTGTTCTCA TTTAAAGTAA GAGACTTAGA AAAACGTCAT CGATCGACAG TCAGCCATCA TACCAATTAT CTCCTGAGAT ATCGCACACA AACAGTTAAA ATCGATGAAG CCATAGTAAA ATACTACGTT GGTGCAGAAT CTGGAAAAAC CGCACTAAAA AAAATATCCC TTGAGATCCC GAGCGAAGAC CTAATTTGTG ATATTTCATA TCGGGGAAGG TTGGGCACCA GCGGATCACT TACTGTTAAC GGAGAAGAGC TGCCTTACAT ATTGCAAAAT TTTGAAATTG TAAACTCCTC CAATGACCTA TTCAGCGCGC CATCCCTAAT TTCAAAAACC AAGGACGCAA ACTCTAACGT ACGAAGAAGA CATTCTTATC AAGATATAGC AGTAAAGGAG CTAGAGAGGT TTCTAAGGCG CGAGATACCT AGAATATCAA GCTCGGACAC CTTTAGAAAT GAAGTCGCAA ATATACTTTC TGGAACACCC CATTTGGAAG AAAAATGGAA ACGCTTATCC GAAACCGCCA ACACAGCTTC GTTTCGAAAA TACTACGAAA ATATTCAAAA GGGAACATCT GGAAAATCCA AAGATAGCAT CCTTACGATT CAACGCACCT TCCGTGCTCT TCAGGTTCTA GATGTGATAA ATGATGACTT ACAGTACTTT TTCTCTGGCG TCAGTTATTT AGGACCAGCA AGAGCGGCAG GGGAACGATT CTATCGGAAA CAAGAGCTGG AGGTTTCCGA AATACTTCCA AATGGCTCGA ATTTCCCAAT GTTTCTTGAT TCACTATCGA CTGGTCAAAA GAGGAGCTTT TCAGATTGGG TCGAGAGTAT TTTTGGTTAC GGCGTTGAGC TCAAATCACA TGAGGGACAC ATTAGCATCC ATCTAAAAGC AGGCGAAAAG TCAGTCAACG TTACCGATAC TGGTTATGGA GTATCACAGA TCTTGCCTGT TCTGGCGACG GTCTGGTGGT CAAGTAGGCG AACCACGGAA CGCTGGGGTA TACCTTATGG GCGGCGACAA GCGATAAGAA CTATAGCTAT CGAACAACCA GAGCTACACC TGCACCCGGC ACATCAGGCG AAGCTAGCGG ACGTTTTTGT GAAGGGTATT AAACTTGGGC AGTCTGAGAC CAATATTAAC ATAGTAAATA TCCTAATTGA AACGCACAGT GAGGCACTCA TCAACAGACT GGGAGAACTA ATCGAGCTGG GCAGCATATC CAGTGATGAC GTTCAAGTTA TTGTCTTCGC TGCAGAAGAC GACATAAATT CACCTACCAG AATGTCTGTG GCCAACTTTG ACAGCAACGG GGCGTTGGAA AATTGGCCCT TTGGATTCTT CAATTACTCT TGA
|
Protein sequence | MLVGRNSAGK STFLRSLPLI RQSLETRSSA PILWYGDFVD FGDFQTAVSS EAKDGYAVFS FKVRDLEKRH RSTVSHHTNY LLRYRTQTVK IDEAIVKYYV GAESGKTALK KISLEIPSED LICDISYRGR LGTSGSLTVN GEELPYILQN FEIVNSSNDL FSAPSLISKT KDANSNVRRR HSYQDIAVKE LERFLRREIP RISSSDTFRN EVANILSGTP HLEEKWKRLS ETANTASFRK YYENIQKGTS GKSKDSILTI QRTFRALQVL DVINDDLQYF FSGVSYLGPA RAAGERFYRK QELEVSEILP NGSNFPMFLD SLSTGQKRSF SDWVESIFGY GVELKSHEGH ISIHLKAGEK SVNVTDTGYG VSQILPVLAT VWWSSRRTTE RWGIPYGRRQ AIRTIAIEQP ELHLHPAHQA KLADVFVKGI KLGQSETNIN IVNILIETHS EALINRLGEL IELGSISSDD VQVIVFAAED DINSPTRMSV ANFDSNGALE NWPFGFFNYS
|
| |