Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0785 |
Symbol | |
ID | 7315046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 844532 |
End bp | 845398 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643615665 |
Product | type IV pilus prepilin peptidase PilD |
Protein accession | YP_002512864 |
Protein GI | 220933965 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.460819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAC TCCTGCAAAC CAGCACGGCG GCACTGCTGA CCGTCACCGC ACTGTTCAGC CTGCTGGTGG GCAGCTTTCT CAATGTGGTG ATTCATCGTC TGCCGGTGAT GATGGAGCGG CAGTGGCGGC GTGAAGCCAC GGAATTCCTG TACGCCGAGG GCCAGACGCT TGCGCCAGCC GAGCAGGGGC CCGCCTACAA CCTCTGGAGC CCCCGTTCCG CCTGTCCCCA GTGCGGTCAC AAGATCACCG CGCTGGAGAA CATCCCGATC CTCAGCTACC TGCTCCTGCG CGGCCGTTGC AGCGAGTGCG GCACCTCGAT TCCGCTGCGC TACCCGCTGG TGGAGGCGGC CACGGCTCTG TTGTCCGTGG TAGTGGTCTG GCACTTCGGC TTCACCTGGC AGTCGGGGGC CGCGCTGCTG CTCACCTGGG CACTGATCGC GCTGGCGGTC ATCGACCTGC GCACCACCCT GCTCCCGGAC AGCATCACCC TGCCCTTCCT CTGGATCGGC CTGTTACTCA ACCTGGGCGG CCTGTTCACC GACGCCACCA GCAGCCTGAT CGGCGCCGTG GCCGGTTACC TGAGCCTCTG GCTGGTGTAT CACGGCTTCA AGCTGCTCAC CGGCAAGGAA GGCATGGGCT TCGGCGACTT CAAGCTCTTC GCCCTCATCG GCGCCTGGCT CGGCTGGCAG CTGCTGCCCC TGGTGATCCT GCTCTCATCC CTGGTAGGCG CCGTGGTGGG CATCGCGCTG ATCCTGTTCC GCGGCCGCGA CCGCGCCCAT CCCATGCCCT TCGGCCCCTA CCTGGCAGCG GCCGGCTGGA TCGCGCTGCT GTGGGGCAAC GACATCATGG CGCTGTACCT CGGGTAG
|
Protein sequence | MIELLQTSTA ALLTVTALFS LLVGSFLNVV IHRLPVMMER QWRREATEFL YAEGQTLAPA EQGPAYNLWS PRSACPQCGH KITALENIPI LSYLLLRGRC SECGTSIPLR YPLVEAATAL LSVVVVWHFG FTWQSGAALL LTWALIALAV IDLRTTLLPD SITLPFLWIG LLLNLGGLFT DATSSLIGAV AGYLSLWLVY HGFKLLTGKE GMGFGDFKLF ALIGAWLGWQ LLPLVILLSS LVGAVVGIAL ILFRGRDRAH PMPFGPYLAA AGWIALLWGN DIMALYLG
|
| |