Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1049 |
Symbol | |
ID | 4078107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1127786 |
End bp | 1129018 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006353 |
Product | cytochrome B561 |
Protein accession | YP_613044 |
Protein GI | 99080890 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG2353] Uncharacterized conserved protein [COG3038] Cytochrome B561 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.183438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.102385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGCC AAAACACCCC GCAGAGCTAC GGCTCCATCA CCAAGAGCTT TCACTGGCTC ACCGCCCTCC TGATCCTGAC CGCCTTTCCG CTTGGATATT TTGCGACCGA ATTGGCAGAG CATATCCAGA GCAGCGCGTT TGACGGCTCC CAAGCGACGA TTGATCGCGC TACGCTCTTG TTTTCGCTTC ATAAAACCAT TGGCGTTGCG GTGTTCTTTA CCGCGCTCCT CCGCATCCTC TGGGCCATCA CCCAGGAAAA GCCGGGCCTG TTGCACCCGG ATCGCAAGCT GGAGGCCTGG GCGGCAGAGA CGGCCCATTG GGTGCTCTAT GGTGCGATGG TGATTGTGCC GCTTTCGGGC TGGATTCATC ACGCCGCGAC CGATGGGTTT GCGCCCATCT GGTGGCCCTT TGGGCAAAAC CTGCCGCTGG TGCCCAAATC CGAGTTTGTC TCCAAACTGT CTTCTTCGGT GCATTTCTAC GCGATGCTGT TGCTTGGCGC GTCGATCCTG GCGCATGTGG GTGGCGCGCT CAAACATCAT GTGATCGATA AAGACAGCAC GTTGGTGCGC ATGCTGCCGG GCCGTCGCGC TCTGCCCGAG CCGCCCGCAC AACACCATTC TGCCTTGCCG CTGCTGACAG CGCTCGTGGT CTGGGGCGCG GTGATCGGTG GCAGCACAAT GCTCTTTCTA AACACCCAGA GCGCCAAAGG CACCGTGGCA CCGGTGGCAG CTGCACCGGT TGAAGGCTCG GGCTGGACTG TCGAAAACGG AACGCTGGCG ATCGAAGTGG TCCAGATGGG CAGCGCCATC ACGGGTACAT TCTCCGACTG GCGCGCCAAG ATCGACTTTG AAGAACCCGC CAGCCCCGGC CCGGCCGGTC GTGTCGAGGT CGCCATTGCC ATCCCATCGC TCACGCTCGG GTCTGTGACC GATCAGGCCA TGGGGTCGGA CTACTTTGAC GCCGAGACCT ATCCGCAGGC GACGTTTGAG GCCGAGATCA TCCAGATCGA GGGCGCGCAG TACGAGGCAA AAGGCACGCT CACCATTCGC GATCAGACGG TGGCGACCAC CCTGCCCTTC ACGCTCGATC TTGACGGGGA TACGGCCACC ATGAGCGGGC GAACGGAGGT TAACCGTCTC GATTTCAACA TCGGGACCGG CACGCAAGAC GAGGGCACCC TGGCCTTTGG CGTCGACATC ACGGTGGATC TGGTCGCGAC ACGCGCGCCC TGA
|
Protein sequence | MSRQNTPQSY GSITKSFHWL TALLILTAFP LGYFATELAE HIQSSAFDGS QATIDRATLL FSLHKTIGVA VFFTALLRIL WAITQEKPGL LHPDRKLEAW AAETAHWVLY GAMVIVPLSG WIHHAATDGF APIWWPFGQN LPLVPKSEFV SKLSSSVHFY AMLLLGASIL AHVGGALKHH VIDKDSTLVR MLPGRRALPE PPAQHHSALP LLTALVVWGA VIGGSTMLFL NTQSAKGTVA PVAAAPVEGS GWTVENGTLA IEVVQMGSAI TGTFSDWRAK IDFEEPASPG PAGRVEVAIA IPSLTLGSVT DQAMGSDYFD AETYPQATFE AEIIQIEGAQ YEAKGTLTIR DQTVATTLPF TLDLDGDTAT MSGRTEVNRL DFNIGTGTQD EGTLAFGVDI TVDLVATRAP
|
| |