Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4022 |
Symbol | |
ID | 5735883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5133032 |
End bp | 5134918 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281172 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546782 |
Protein GI | 159900535 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCTA ATACCAAGCA ATTAATCATT TGGGTTGGCA ATGCTGGCGA GCAACGAACT CAGGCTGCTC AAGCATGGTT AGCTGACCAA ACTGCGGCAC GAACTTGGTT GCTCAGCATG AATCGAGCCA ATTTGGGCCA TTGGGCCGGC TTGAATAGCT TTGTTAGCTC GCTATTACCA GCGATTAAGG CCCAAGCCCC CGAATTATTA GTCAAATATG ATTATGAGTT GGCCTTGGTT TTGCCGCAAT TGCAACGTGA ATTGGCTGTG CGCAACCCAA GCCTCACCGA TATTAGCAAT CCCCAAGAGC GCACCCGCAA TTATGCTGCT GATCGCGCCT TTCGGATTGT CCAAGGGCTG ATTGATTTGG TCCATGAATT CCGCCAAACT GCGCCAGATC AGCCATGGGT CATTGTTTGT GATCAGTTTG ATCAGGCTGG CTCGTTGGTA ACCATGTTTT TCAAAGAACT CTTGCGGCGT GCCGACCCAC GCATGAGCTT AACCATTGGC TTTTTAGTTG AACCCAACCA GCCAGAATTG CTGAACGAGT ATCACAGCTG GCATGCGCAT GTTGAGGTCG TGCAAGGCGC TTGGCAAGCT GATCCGCCGA TTGTTATCGA TCCGGCTGAG GCCAAACGCC AGCTCGAAGT GCTGGAACGT GAGCATACGT TTGATCCAAT TGAGATTGAA TTGCATCTGC CTCAATTGAT TCAGCTGGCA ACTGCCGCCA ACGAGCCACG CAAACGCTTG GGTTTTATGC ACGAAGGCTT GTCGATCTGT TCAACTCGTG GTTTATACGC CGATGCCTTG TATTATGGCG AGCCACTGCG AGTCGCCATG GAACACGAGT TTCCCAAGAG CGTCGATTTT CGCTTGAGTG CCTACTTAAA GCTCTACAAT TGTTATATTG GGCTAAAACA AGCTGAACCA GCCTTGGATA TTGCCGAAAC TGCGGTGAGC ATTACTGATA ATCCTGCCCG TTTATATAGC TGGTATTATC TGATTGCAAT GATTCATGCA CGATTTGCCG AGCCACGCGA TTTTGATAAA GCCGAGTATT ACCTCGATTT AGGCATTGAA GCGATAAATA AAGCCGATAT TCCAGCGCAT GAAAAACTAT TTCAATCGTC ATTCAATCGC AATGGTTTAG CATTAATTCG TCACTTCCAA AAACGCCCCT TGGAAGCGAT TGAAATTTGC CAAGAATGTT ACAAAAATCT TGAAACTGGC TTAGATACCG AAGATCATAA GCTGCATCGC TCGGTATTGC TCTACAATAT TGCTCAGGTT TACGATTCGC TCAAAGATTA TCCAAAAGCA ATCGAATATT ATAGCTTGAC GATTGAAATG GACCCCAACT ACGCTGAATA TTATAATGAG CGAGCCAATA TTTATCTGCA TATCGGCGAT TATGCTGCTG CCGAACGCGA TTACCAACGC TCAATCGAGC TTAGCCCACC CTACACCGAA GTTTGGACAA ACCTTGGCCA GTGCTACCAA GTTCAAGAAC ATTTTGAAAA AGCCATTGGC GCATTCAGCC GAGCGCTCGA TATTGACCCC AAAAATGTGG TTGCGCTCAA TCATCGAGCT GAATGCTACG AGGGTTTGGG CCAAACTCAG GCGGCAATTG CCGACTACAG TGCCAGCCTA AAGCTGAAAA CCAGCGAAGC CTCAACCTTT GCCAACCGCG CGATTCTCTA CTACGAGTTG GGCGAAATTG AAGCTTCATT GGCCGATTTG AATACCGCCA TCAGCCTCCA ACCCGATCTC GCTGAGCTGT ATGAAAATCG TGCGGTCGCT TTGGAAGCGC TCGAACGCCA TGCTGAAGCT GAACACGATC GTCAACAAGC GATTTTGTTG GCCAAAGCCC ACGAGGTTAA TGGCTAA
|
Protein sequence | MTSNTKQLII WVGNAGEQRT QAAQAWLADQ TAARTWLLSM NRANLGHWAG LNSFVSSLLP AIKAQAPELL VKYDYELALV LPQLQRELAV RNPSLTDISN PQERTRNYAA DRAFRIVQGL IDLVHEFRQT APDQPWVIVC DQFDQAGSLV TMFFKELLRR ADPRMSLTIG FLVEPNQPEL LNEYHSWHAH VEVVQGAWQA DPPIVIDPAE AKRQLEVLER EHTFDPIEIE LHLPQLIQLA TAANEPRKRL GFMHEGLSIC STRGLYADAL YYGEPLRVAM EHEFPKSVDF RLSAYLKLYN CYIGLKQAEP ALDIAETAVS ITDNPARLYS WYYLIAMIHA RFAEPRDFDK AEYYLDLGIE AINKADIPAH EKLFQSSFNR NGLALIRHFQ KRPLEAIEIC QECYKNLETG LDTEDHKLHR SVLLYNIAQV YDSLKDYPKA IEYYSLTIEM DPNYAEYYNE RANIYLHIGD YAAAERDYQR SIELSPPYTE VWTNLGQCYQ VQEHFEKAIG AFSRALDIDP KNVVALNHRA ECYEGLGQTQ AAIADYSASL KLKTSEASTF ANRAILYYEL GEIEASLADL NTAISLQPDL AELYENRAVA LEALERHAEA EHDRQQAILL AKAHEVNG
|
| |