Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3418 |
Symbol | |
ID | 4075592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 439560 |
End bp | 440591 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638004927 |
Product | ApbE-like lipoprotein |
Protein accession | YP_611652 |
Protein GI | 99078394 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.999205 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGAACC CTCTTCTCCT TTCTCGCCGT AGCTTTATGG TCATGCCTCT TGCCCTGATG GCCTGCAAGA AAGGGTGGTC GCTGTTGGAA CTCAGCGGGC TGACGATGGG GACAAGCTAC TCGATCGTTG CGATCGATCA CAGCAAATCC GTCGAGAAAG CAGAGCTGCA GGCGGCTGTC GACAAGGCGC TGGCGCAGGT CAACGTGCAG ATGTCCAACT GGGATGCCGC GTCCGAGGTG TCGCAGTTCA ACGCGCTGGC CGCTGGCGAG AGCCTGTCGG TCTCTGGCGA GCTGCATCAT GTGATGCAGG CTGCGCAGGA CGTGCATTTT GCAAGCGACG GTGCGTTTGA CGTGACCGTG GGTGGTCTCA TCGATCTTTG GGGCTTTGGC GCGGGTCAGA CCCGCAGCGA TCTTCCCTCC GAGGCCGAGA TTGCGGCTGC CATGGGGTGC TGCGGTCAGG CGCAGTCGGT CGAGCTTGAG GCCGGTGGCC TCAAGAAGCT TAACGCTGGC GCCGAGGTCT ATCTGTCCTC CATCGGAAAA GGGTTCGGTG TTGATCAGCT GGCCCGAGTC CTCAAGGGCT ATGGCATCAC CGACTACATG GTCGAGATCG GTGGCGATCT CTACACCGCT GGGCGCAACC CCGATGGTCA GCCCTGGCAG ATCGGGATTG AGACGCCCGA GGCTTTTGAC CGTGGCGTGA CCCAGGTGGT TGGGCTTTCT GACATGGGCA TGGCCACCTC TGGCGATTAC CGCAACTTCT TTGACGTCGA TGGCAAGCGC TACTCGCATA TCATCGACGC CACCACGGGC CGTCCGGTGG AACACGACAC CGCGTCTGTC ACCGTTCTGA CCGACAACGC GATGCTGGCG GATGCATGGG CGACAGCGAT GCTGGTGCTG GGTCGCGAGC GCGGTCTCGA GATCGCAAAT CAGCGTGATC TCGCGGTTCT GTTCCTTGAT CGTGCGGTTG CAAATGGCGA CAATGGGTTT ACCTCTGTCG CATCAAACCG TTTCGAGGCG CTGACCGCGT AA
|
Protein sequence | MSNPLLLSRR SFMVMPLALM ACKKGWSLLE LSGLTMGTSY SIVAIDHSKS VEKAELQAAV DKALAQVNVQ MSNWDAASEV SQFNALAAGE SLSVSGELHH VMQAAQDVHF ASDGAFDVTV GGLIDLWGFG AGQTRSDLPS EAEIAAAMGC CGQAQSVELE AGGLKKLNAG AEVYLSSIGK GFGVDQLARV LKGYGITDYM VEIGGDLYTA GRNPDGQPWQ IGIETPEAFD RGVTQVVGLS DMGMATSGDY RNFFDVDGKR YSHIIDATTG RPVEHDTASV TVLTDNAMLA DAWATAMLVL GRERGLEIAN QRDLAVLFLD RAVANGDNGF TSVASNRFEA LTA
|
| |