Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0241 |
Symbol | |
ID | 7084362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 276311 |
End bp | 277456 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643697283 |
Product | rare lipoprotein A |
Protein accession | YP_002353932 |
Protein GI | 217968698 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0797] Lipoproteins |
TIGRFAM ID | [TIGR00413] rare lipoprotein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.812141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCAG TTCCTCCCGT GCTTCCCGCA GCCTTCGCAG CCCGCCCGCA GGCAGCGCGG GCGCCGCGTG CCCTGGTGGC GGCAGGCCTC GCCTGCGCCG TCGCGCTGCT CTCGGCCTGC AGCTCCGCGC CCAAGCGCGA AGGACCGGTC GCCGAGGTGC CGGCGCAGGC CGGAAAGCCC GGACGCAGCC CGGTGCGCGG TGGCGGCTAC TACAAGGACG ACGGCCCGGG CGACGACATC CCCGACAACC TCGACGCGAT CCCCGACGCG GAGCCGCGCG ACGAGCCCCT GCATCGCTTC GCCAACCGCC CCTACCACGT CATGGGGCAG AGCTTCGTGC CCGCCACCGA GCTGCGCCCC TTCCGCCAGC GCGGCCACGG CAGCTGGTAC GGCCGCCGCT TCCACGGCAA CCCGACCTCG AGTGGCGAAC CCTACGACAT GTACGCGATG ACGGCCGCGC ATCCGACCCT GCCGATCCCG AGCTACGCGC GCGTGACCAA CCTCGCCAAC GGCCGTTCGG TGGTGGTGCG GGTGAACGAT CGCGGCCCCT TCCTGCGCGG CCGGGTGATC GACCTGTCGT ATGCCGCGGC GCACAAGCTG GGCTATGTCA ACGGCGGCAG CGCCGAGGTC GAGGTCGAGC AGATCCTGCC CGGCGAGGCG CCCCTCGTCG CCATGGCGCG CCCGGTGCCG CCGCTGCCGG CGCGCGCCAA CGAGGGCCTC ATGCCGGGGT CGGGGCCGGC GACCGGGATG AAGGCACTCG CGCCCTTGCC GGTGGCGCAG GCTCCGCTGC CCGCGCCGGT CGCGAGCCCC CCGCGGGCAG CCGGCAGCGC GATCGCCCTC GCCCCGCCCG AGTGCGAGGA AGGCCCCGTC TGCGGTGCCG CGGGCGGCTT CGCGCCGCCG GTCGCTTCGG CGAGCGCGGG CATCTTCCTG CAGCTCGGCG CCTTCTCCTC GTTCGCCAAC GCCGAAGGCT TTCGCGACAC GGTGCGTAGC CAGGCCTCGG ATCTGGCCGA GCGTTTCGAA TTGTTCGCCG ATGGCGAACG TTTCCGCCTC CACGCCGGTC CCTACGACAC GGTCGAGGCG GCGCGCGATG CGGCCGAGCG CATGGGCAAC GTGCTCAAGC TCAAGCCCTT CGTGGTGGTG CGCTGA
|
Protein sequence | MTPVPPVLPA AFAARPQAAR APRALVAAGL ACAVALLSAC SSAPKREGPV AEVPAQAGKP GRSPVRGGGY YKDDGPGDDI PDNLDAIPDA EPRDEPLHRF ANRPYHVMGQ SFVPATELRP FRQRGHGSWY GRRFHGNPTS SGEPYDMYAM TAAHPTLPIP SYARVTNLAN GRSVVVRVND RGPFLRGRVI DLSYAAAHKL GYVNGGSAEV EVEQILPGEA PLVAMARPVP PLPARANEGL MPGSGPATGM KALAPLPVAQ APLPAPVASP PRAAGSAIAL APPECEEGPV CGAAGGFAPP VASASAGIFL QLGAFSSFAN AEGFRDTVRS QASDLAERFE LFADGERFRL HAGPYDTVEA ARDAAERMGN VLKLKPFVVV R
|
| |