Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0185 |
Symbol | |
ID | 5732094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 214663 |
End bp | 215820 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277309 |
Product | PilT domain-containing protein |
Protein accession | YP_001542965 |
Protein GI | 159896718 |
COG category | [R] General function prediction only |
COG ID | [COG4956] Integral membrane protein (PIN domain superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000320699 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATTT CGGAAAAAAA GGCTGTGGCT GTAAACAGAA ATCGCTTACT TAGCATTGAT TTTTGGGTGC GCATTTTGGG GATGGTCGTG CTTGGCTATA TTGGCTGGTA TTTTGGCTCT AGCTCGGCCA GCAATCCACG AACAAGCGAT GAAACCTTAG CCATGCAGCT ACTCACACTT TCGGGGGCTG GCTTGGGTTT ATTGATATCC CATCGGATAA CCTTATACCC AATTCGCAAT ATCAACCAAC GCTTGCGTCG TAGCACGGCT CAAGAGTTGA TTGCCTTGGC GCTGGGATCA TTATTGGGTG TGATGCTCGC AGCATTATTA TCCATCCCCC TAAGCCAACT ACCTGGTCTT TTAGGCAGCT ATTCGCCGGC ACTTGCCAGT TTCTTTATTA TCTACTTTTG TGTTGTAGCC TTCGAGTATC ACAAGAAAAA TCTGGTTAAT TTTGGGGTTT CGCTGCAAAC ACCCAAAGTT CGCGCAGTTA AAGAAGCTGT CCCAATGCGT CGAACCTGTT TGGTTGATAC AAGCGCCATT ATTGATGGTC GAGTTTTGGC GGTTGTACGC AGTGGGTTTC TTGATGGGAT TTTGGTTGTG CCGCGCTTTG TGCTCAACGA ATTGCAATTA TTGGCCGATT CGAGCGATGA TATGAAGCGG ATGCGCGGTC GCCGTGGCTT GGATATGCTC GAAGAAATTC GCAAAGATGA TCAATTGCGG CTCGAAATGC CCAACGATGA TATTGCCAAT GCGCGTGGCG TTGACCAGAA GTTGGTGACC TTGGCGCTGC AAGATGGTCA TGCCTTGATT ACCAACGATA AGAATTTGAG CCAAGTTGCT GAATTACAAG GCGTGCAGGT ACTCAATTTA AATGTGCTTT CCGATGCGGT ACGCCCACCC GTTGGGGCTG GCGAAATGTT GGTTGTGAAA GTGCGTGAAG AAGGCCGCGA GCGCGAACAA GGCATTGGCT ACCTCGAAGA TGGCACTATG GTAGTCGTCG AAGATGCCCG CGAACGGATC GGTGATGAAG TTCGGGTGAT CGTCAGCCGG GTCTGGACGA ACGATCGTGG TCGCATGGTC TTTGGGCGAA TTATGGGCAG TGCTGGAGCA TTTTACGGGG GCAAAAACGA TGCGGGCAAT TATCCAGCGC GTAGCTAA
|
Protein sequence | MTISEKKAVA VNRNRLLSID FWVRILGMVV LGYIGWYFGS SSASNPRTSD ETLAMQLLTL SGAGLGLLIS HRITLYPIRN INQRLRRSTA QELIALALGS LLGVMLAALL SIPLSQLPGL LGSYSPALAS FFIIYFCVVA FEYHKKNLVN FGVSLQTPKV RAVKEAVPMR RTCLVDTSAI IDGRVLAVVR SGFLDGILVV PRFVLNELQL LADSSDDMKR MRGRRGLDML EEIRKDDQLR LEMPNDDIAN ARGVDQKLVT LALQDGHALI TNDKNLSQVA ELQGVQVLNL NVLSDAVRPP VGAGEMLVVK VREEGREREQ GIGYLEDGTM VVVEDARERI GDEVRVIVSR VWTNDRGRMV FGRIMGSAGA FYGGKNDAGN YPARS
|
| |