Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2020 |
Symbol | |
ID | 4710385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2223153 |
End bp | 2224313 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639856493 |
Product | type II secretion system protein E |
Protein accession | YP_001003586 |
Protein GI | 121998799 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.453944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG ATCGCGCCAC GGTCCGCCTC GTGGACCGTC TTCTTGCCGA CGCCGTGCGG CGTCGGGCCT CGGATATCCA CCTGCAACCG GAGGCCGACC GGGTGCGCGT GCGCCTGCGC ATCGACGGCC TGCTGCGCGA GGCCGAAGGG CCACCGCCGG GCCTGCGCGG ACGGGTTGCG GCACGCATCA AACTGCTGGC GGGGATGGAC GTCGCCGAGC AACGCCTGCC CCAGGACGGC CGGCTGGAGG CCCGGGACGG CGACGGGCAG CGGGTGCAGT TCCGCGTCGC CAGCTGCCCG GGCGTCCACG GCGAGAAACT GGTCCTGCGC CTGATCGAGC AGGACGCCCC GGCCACCCTC GACGCCCTGG ACCTCCCCGG CCCGGCCCGG GCGGCACTGG AGTCGGCCCT CGACCGCCCC GACGGACTGA TCCTGGTCAC CGGCCCCACC GGCTCCGGCA AGACCGCGAC CCTGCACGCC GCCCTGCGGC GGCTGAACAC CCCCGAGCGC AACATCTGCG CCGTGGAGGA TCCCGTGGAA ATGGACACCC CCGGGGTCAA CCAGGTGGCC GTGAACCGCC GCGCCGGTAT CGACTTCGCC CAGGCCCTGC GCGCCTTCCT GCGCCAGGAT CCGGACGTGA TCATGATCGG CGAGATCCGC GACGCCGAGA CCGCCGCCAT CGCCGTCAAG GCGGCCCAGA CCGGCCACTT GGTCCTCTCA ACCCTGCATA CACGCAGCGC GCCGGGCGCG GTGGAGCGCC TGGCGCAGAT GGGTCTGCCC GGCTACGACC TGGCCTCGAG CCTCTCCCTG GTGGTGGCGC AGCGCCTGGT CCGCCGCCTC TGCCCGGCCT GCCGCGAGAC CAGCAGCGCC GCCGCTCAGC CGGCGGCAGC GGAGCCGGCC GGCGTCTACC ACCCCCGCGG GTGCCCGGAG TGCCAGGACG GCTATCGCGG TCGGCGGGGG GTCTTTCAGG TCATGCCGAT GACCGACGCC GTGGCCGATG CCGTCCTGCA TGGCCCTTCG GCCCGCGAGA TCGAAGCCCG CGCCCGGGCG GCGGGCATGC CGGATCTCCA CGACGCCGGC TGGCCCCTGG TGGAGACCGG CGAGACCAGC GCCGCCGAAC TACGCCGCGT CACCCGCGAG GCCGAGCCGT GGCCCGCCTG A
|
Protein sequence | MDDDRATVRL VDRLLADAVR RRASDIHLQP EADRVRVRLR IDGLLREAEG PPPGLRGRVA ARIKLLAGMD VAEQRLPQDG RLEARDGDGQ RVQFRVASCP GVHGEKLVLR LIEQDAPATL DALDLPGPAR AALESALDRP DGLILVTGPT GSGKTATLHA ALRRLNTPER NICAVEDPVE MDTPGVNQVA VNRRAGIDFA QALRAFLRQD PDVIMIGEIR DAETAAIAVK AAQTGHLVLS TLHTRSAPGA VERLAQMGLP GYDLASSLSL VVAQRLVRRL CPACRETSSA AAQPAAAEPA GVYHPRGCPE CQDGYRGRRG VFQVMPMTDA VADAVLHGPS AREIEARARA AGMPDLHDAG WPLVETGETS AAELRRVTRE AEPWPA
|
| |