Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3800 |
Symbol | |
ID | 4074951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | + |
Start bp | 53072 |
End bp | 54244 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638004459 |
Product | hypothetical protein |
Protein accession | YP_611194 |
Protein GI | 99077935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.945188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCAGT TCACGGAAAC AACGCAAGTT GGCCTGTTTG CCCGGCTGAA GAACGCCATC AAAGGCATCG GGCTTGGAAT CAGCTTTATC GGTATCGCTG TGTATTTTCT GTTTTGGAAC GAAGGCAATG CCGTGCGTAC AGCTCGGGCG CTTGCCGAAG GGGCCAACCA GGTCGTGTCG GTCGACCACA CGGCAATAGA TCCGCAAAAC GAAGATCGCC TTCTGCATAT AGGCGGCCCG CTTGCGCTTG AGGTGCCGCT GGCGGATGTG GCGCTGGGGG TGGTTGCGTC TGCACAAACC GTGCGTCTTG AACGCAAGGT AGAGCAATTT GCCTGGATCG AAGACAAGCA AACCAAGACG GAGACAAAAC TAGGTGGCGG GCAGGAAAAG ACCACGCGAT ACACCTATCG TCAGGGTTGG ACGGATGCGC CAGCCAGTGG AGCAGAGTTT CGGGTCCCCG AGGGGCATAT GAACCCGCCG ATGCCAATCG CATCGAAAGT GATCCGACAG CCAGAGGGCA CAATTGGTGC TTTTACCGTA GATGATGAGA TTTCAGATCT GGGCGGTTCA ACACCAATGC TGTTGGACTC ACAGCAGGCG GAGGATGTCG CGCGGGCTCT TTCGTTGCAG CAACCGGCGA AACTGGTCGC TGGGCAGGTG GTTTTTGGTG CAGATGTCAC GGCTCCGCAG CTGGGTGATA TCCGGGTGAG CTATCGCGTA TCTGAAATCG AAGAGGCGAG CGTGGTCGGT GTACAGCGCA GCGACACGTT GTTGCCCTAC ACTGCCCAAA ACGGGCGCAA GATCTACTTG GTGGCAGAAG GCTTGAAGAC TGCGGACGAG ATGTTCCAGA CAGCTGTTTC CAACAACACC TTCAAAACTT GGATGTTGCG CATCGGTCTC TTGGTCCTGC TGTTTTTGGG ATTTAAGGCG CTGTTCGGCG TCGTAGACGT AATTGCCAGC ATTCTGCCGT TTCTGGGATG GATCACGGCT TCTGTCACCT CCTTGATCAG CGTTGCTCTT ACGCTGGTTG TCGGTGGCAC CACGATAGCG ATTGCTTGGG TCTATTTCCG CCCAGTTCTG GCGCTCCTTA TCATTGCTGT CGCTTTGGCC GGAGCAGCCG CCAGCGCCTA TTGGCTGCGG AAAGCGGCGC CCGAGACACC TAAGACACCC TGA
|
Protein sequence | MSQFTETTQV GLFARLKNAI KGIGLGISFI GIAVYFLFWN EGNAVRTARA LAEGANQVVS VDHTAIDPQN EDRLLHIGGP LALEVPLADV ALGVVASAQT VRLERKVEQF AWIEDKQTKT ETKLGGGQEK TTRYTYRQGW TDAPASGAEF RVPEGHMNPP MPIASKVIRQ PEGTIGAFTV DDEISDLGGS TPMLLDSQQA EDVARALSLQ QPAKLVAGQV VFGADVTAPQ LGDIRVSYRV SEIEEASVVG VQRSDTLLPY TAQNGRKIYL VAEGLKTADE MFQTAVSNNT FKTWMLRIGL LVLLFLGFKA LFGVVDVIAS ILPFLGWITA SVTSLISVAL TLVVGGTTIA IAWVYFRPVL ALLIIAVALA GAAASAYWLR KAAPETPKTP
|
| |