Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4222 |
Symbol | |
ID | 5736076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5378563 |
End bp | 5383836 |
Gene Length | 5274 bp |
Protein Length | 1757 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281377 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546982 |
Protein GI | 159900735 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.236113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTCA CCACGATCCA AGCACTCTAC GATGAAGCAC GGTCTGCTCT CGAAACAGGG AAGGAGGAGC GTGCCATTGG TGCGAGTGAG CATCTGCTTG AATCGTTTCC CTATTATCTT GAAGCGTACC GTATCCTCGG CGAATCGTAT CTTAACCGCC AAGATTTAGC CAAAGCCGTC GAAGCCTTTG AACGGGTCTT ACGTTCTGAC CCCGAAAATA TCCCTGTCCA TGTGGGCTTG GGGGTAACTT ACGAGCGTCA AGGTAACCTC GCAGCGGCGA TCCGCGAGTT TGAGCAAGCA TTTGAAATTA AGCCCGATTT GCCTGAGTTG CGCTCGCAAG TGTTGCGTTT GTATACCGAG GCTTGGGGTA GCGAAAACGC CCGAATCTTG CTGAAAAAGG CAGGTCTCGG GCGTATGTAT GTGCGCGGAC GACGCTTCGA TAAGGCGATT CAGGAATTTA ACGATGTGCT CGCCGATGAC CCCAAACGGG TGGATATTGC GGTTGCGCTG GCTGAAGCAC TCTGGCGTAA CGGCCAAGAG GCTGAAGCCG CCGAAGTTGC CAGCGATATT CTGCGCGATT ACCCCGATAT GCTCAAGGCT AACTTGATTT TGGGCTATCA TTTGTTGGCC GCTGGCGACC CCAAAGGCCG CAAGTTGTGG CAACATGCCC AGCAACTCGA CCCCAGTCAA GGCGTGGCCT ATGCCTTATT CGATGGCATG CTGCCACCAG TCGAAGCCCC CGATACCAAA ATCGAGGCCT TCGATGAAGC CGCTTGGAAA GCCAAAAAAG CTGAAAAGGC CGAAAAAGAG CGGCTTGCCC GCGAAGCTGC CGAACAAGCC GAGCGCGATC GCTTGGCCGC CGAGCAAGCG AGCAAAGCGC CAGCCGCCTC TTGGTTGGCC GAAGTTGAGG CTGCCCCGGT TGCAGTCGGG GTTGCGGCTA ACGCCGATGA TGATTTCTTG CGTAGCTTGT TGTTTGGCGA TTTTGGGGCA GCTCCTGCTA GTCCTGCACC CACAGCGGTT GCAATCGCCG AGCCAGAATA CGATGTCGAT TTCAATCTCG ACGACTTGGG CTTAGATCTT CAGCCCTTCT CGCTCGATGA AGTTGATGAC ACCCCCAGCA ATAAAGCGCC AACGCCTGCG CCCGAGCCTG TGGTTGCTCC AACCAAGCCT GCTCCTGAGC CAGTAGTTGC CAGCAATGAT GATTTTGAGT TGCCGGGCGA TCTAACGCCC TTCAGTTTTG GCGATTGGGA TGAAAGCAGC ATCGATGATA TTCCGGGCAC TGGCGCGGAT ACTGGCAAGT TGCCTGAGCA ATTGCAGCCA TTCTCCTTGG AAAATTTTGA TGATGTACAA CCTGAAACCA GCTCGAAAGC CCAAGATGAT GATTTGGCTT TACCAAATAC CTTAAAGCCA TTCTCACTTG ATGAACTGAG CTTTGATTCA ATCGACACAC CATCAGAGCC AAGCATGCCC TTCAGCCGCG ATTTACCCAG CTTCCCCGAT ACCGATCAAG AATCAGGTGG TTTTAGCTGG CAACAACCAC GTTCACGTTC GCGCTCGATT TTTGGGGCAC AACCTGAACC AGAGTTCAAC CAAGATGAAG AAGATGATGC TGGAATTTTC AACCGCATGG TGATCAAGAA GCAGACGCAA ACCCTGCCAC CATTGGTTGA GCCAGTCTAT GATCCTGCCG ATTCTGACGA TGAGGCAATG AATTTCTTCT CGAATGATGA TGTTGATTTG CGTAATTACG ATGAGCATGA TCTGCACACT GCTGATACTG AGCCAGCAAT TACGCCATTC TCATTAACTG AATTGGGCTT GGATGCTGAT GAAATTGCTG ATTACAATGC CATGGGCAAT CTCAGCCAAC CGCAAACCCA AGTCGATGAA GAATTGGAAA TTAAGCCCTT CTCGCTGACT GAACTTGGCT TGAGCGACGA TGAAATTGCC TTGCTGCAAG CTGGCGAAGA TAGCGCTTTG CCCACCGATC CAACTAACGA TGAATCGGGC TTAACCCCAT TTTCCTTAAA TGAATTTGGT TTTGACACGC CTGCGACTAA CGATGATCAA GGCTTCAACT TCGATGAGCC AAGTATCACG CCATTCTCGC TCGATAATTT AGGGCTTGAT CCTGAGGAGC AAGCGCTTTA TTCAGGCGAG TTCAATCCAG CACCTGTGGC TGAACCCACG CCCAAGGTGG AAGAAGAGCC AAATATGACT CCCTTCTCCT TGACCGATTT GGGACTTGAT GCTGATGAAA TTGCTCAATT TGAAGCAATG AATCAAGCAC CAGCTGCCAT CGATGATGAT GGTTTTGATG GTGGTTTGCA GCCATTCTCG CTCGATGATT TGGGCTTTGA TCAAGAACCT CAAGCCTATG AAGAGCCAGT GCGCGAACTA AGTGATAGCT CACAACCCTT CTCCTTGAAC GATATTGGTT TATCGGCAGA GGAAATTGCG GCAATCGAGC AAGCTGGCCA AAATCAAAGC GATGATCCAA TCTTTGATGC GCTTTTGGGA ATTGGTCAAC AGCAAGGCTA TGTCGATTTG ACCGATATCA TCAATCAATT CGATGATCCT GAATCGCAAA CCGAGGAAAT CGACCGCATC GCTTTGGCCT TGCACGATAA TGGTATTCAA ATTCGCGATG GCGATGAAGT GATTAATATG GACGAGGAGT TCTCTGGTGA CGAGGCTGAA GCCTACGAAA CCGAAGAACC ACTTGAAAAC TTCATTGCTG GCGATTTCGA CCAGCCAACT GCTGAAGTCG AGCCAGAAAT GACTCCGTTC TCGTTGACCG ATTTGGGCTT GAGTGCCGAA GAAATCGCGA TGTTGAATGG CGAAACCAGC GAAGCGCCGA CTGAAGAGCC AGCACCATTC TCCTTCGATA ATTTCGAGCT TGAGCAACCC AGTGCCGAAG TCGAGCCGGA AATGACTCCG TTCTCGTTGA CCGATTTGGG CTTGAGTGCC GAAGAAATTG CGATGTTGAA TGGCGAAACC AGTGAGCCTG CTGCCGAAGA ACCAGCACCA TTCTCCTTCG ATAACTTTGA GCTTGAGCAA CCCGCAACGG ACGCTGAGCC AGAAATGACT CCGTTCTCGC TGACCGATTT GGGCTTGAGC GCCGAAGAAA TTGCCATGTT GAATGGTGAA ACCAGCGAAG CGGCGGCAGA AGAACCAGCG CCATTCTCCT TCGATAATTT CGAGCTTGAC CAACCCGCAA CGGATGCTGA GCCAGAAATG ACTCCGTTCT CGCTGACCGA TTTGGGCTTG AGCGCTGAAG AAATCGCCAT GTTGAATGGC GAAACCAGCG AAGCGGCGGC AGAAGAACCA GCACCATTCT TCTTCGATAA TTTCGAGCTT GAGCAACCCG CAACGGACGC TGAGCCAGAA ATGACTCCGT TCTCGCTGAC CGATTTGGGC TTGAGCGCCG AAGAAATTGC GATGTTGAAT GGCGATACCA GCGAAGCGGC GACCGAAGAG CCAGCACCAT TCTCCTTCGA TAATTTCGAG CTTGACCAAC CTGCAACGGA TGCTGAGCCA GAAATGACTC CGTTCTCACT GACCGATTTG GGCTTGAGCG CTGAAGAAAT CGCGATGTTG AATGGCGAAA CCAGCGAAGC AGCGGCGGAA GAACCAGCAC CATTCTCCTT CGATAATTTC GAGCTTGACC AACCCGCAAC GGATGCTGAG CCAGAAATGA CTCCGTTCTC GTTGACCGAT TTGGGCTTGA GCGCCGAAGA AATCGCAATG CTGAATGGTG AAACCAGCGA AGTGGCGGCG GAAGAACCAG CACCATTCTC GTTTGATAAT TTCGAACTTG AGCAACCAAG TGCGGAAGCC GAGCCAGAAA TGACTCCGTT CTCGCTGACC GATTTGGGCT TGAGCGCCGA AGAAATCGCG ATGCTGAATG GCGAAACGAG CGAAGCGGCG GCAGAAGAAC CAGCACCATT CTCCTTCGAT AATTTTGAGC TTGACCAACC CGCAACGGAC GCTGAGCCAG AAATGACTCC GTTCTCGTTG ACCGATTTGG GCTTGAGTGC CGAAGAAATT GCCATGCTGA ATGGTGAAAC CAGCGAAGCG GCGGCAGAAG AACCAGCTCG ATTCGCATTC GATGATTTTG AGTTTGAGCA ACCAAGTGCC GAAGCTGAGC CAGAAATGAC TCCGTTCTCG CTGACCGATT TGGGCTTGAG CGCCGAAGAA ATTGCTGCTC TGGAAGGCAC AAGCGCACCT GAACCTGAGA TTGAAGCTGA AGAGCCTGAT ATGTCGCCCT TCACCTTTGA TCAACTAGGC TTGAGCGCCG AAGAAATTGC CGCATTGGAA GGTAACGAGG CTCCAGCAGC CGAGCCAGCC AGCGAGCTAG AGCCTGATCT CACGCCATTC TCGCTAGCCG ATTTGGGCTT GAGCGATGAT GAAATTGCTT CATTGCAACA AGGCGACGAT GATCGGAGCT TGAAGCTGAG CGAAGAAGAA TTGATGGGCA TCGATTTTGC TTTGCCAAGC GCCGAGCCAG AACCAGAGCC AGTGGTCGAA GCAGCGCCAG CCGAACCCGA AATGACCCCA TTCTCGTGGG AAGATTTGGG CTTGAGCGAC GACGAAATTG CCATAATTGA ATCGCCAGCC GAGCCAGTGG TTGAGGTTAC GCCAGTTGTG GTCGTGCCAA CGCCGCCTGC AATTGTCGAA ACCCCACCAG CAATTAGCGC TACACCAACG CCAAAAGCTG AGCCAGAGGC TGATAACAAA CGTGCTCCGT TGTATCCAGT TTCATCACGG CCACGTGAAG AAGAACAACG CCCACGCGCA CCGTTGTATA ACGTCTCGCC ACGCCGTGAA CCAAAACCAG CGGCTGAAAC GCCAGTGGTT GAAACCTCAC CCGTGGTGGC TCAAACCCCA ACACCAGTAG CAGTAGCACC TGTGGCTAGC CCAACAGTTA CGCCAAGCAG TGGTGGGGCC AGCGGCGATG GCCCAGACTT TAGCGAATAC TACCAACAAC TTGAGGCCGA CCCAAACAAC CATGGCTTGC GAATGGCCTT GGCGCGGATG ATTAGTCAGA CCGCCTCGGT TGATCAAGCC TTAAACGAAT ACAAACGGCT GATTAAACAA AATCAACTGA TGGATCAAGT AGTTGATGAT CTGCAAGATC TGATTGAGTC GCACGATGAC CCAGGCTTGC TCCAACGGCT GCACCGTGCT TTAGGTGATG CCTATTCTAA GCAAGGACGC TGGCGTGAAG CGATGGATGA GTATGGCTGG GTGCTAAATA AACCACGTCG TTAA
|
Protein sequence | MALTTIQALY DEARSALETG KEERAIGASE HLLESFPYYL EAYRILGESY LNRQDLAKAV EAFERVLRSD PENIPVHVGL GVTYERQGNL AAAIREFEQA FEIKPDLPEL RSQVLRLYTE AWGSENARIL LKKAGLGRMY VRGRRFDKAI QEFNDVLADD PKRVDIAVAL AEALWRNGQE AEAAEVASDI LRDYPDMLKA NLILGYHLLA AGDPKGRKLW QHAQQLDPSQ GVAYALFDGM LPPVEAPDTK IEAFDEAAWK AKKAEKAEKE RLAREAAEQA ERDRLAAEQA SKAPAASWLA EVEAAPVAVG VAANADDDFL RSLLFGDFGA APASPAPTAV AIAEPEYDVD FNLDDLGLDL QPFSLDEVDD TPSNKAPTPA PEPVVAPTKP APEPVVASND DFELPGDLTP FSFGDWDESS IDDIPGTGAD TGKLPEQLQP FSLENFDDVQ PETSSKAQDD DLALPNTLKP FSLDELSFDS IDTPSEPSMP FSRDLPSFPD TDQESGGFSW QQPRSRSRSI FGAQPEPEFN QDEEDDAGIF NRMVIKKQTQ TLPPLVEPVY DPADSDDEAM NFFSNDDVDL RNYDEHDLHT ADTEPAITPF SLTELGLDAD EIADYNAMGN LSQPQTQVDE ELEIKPFSLT ELGLSDDEIA LLQAGEDSAL PTDPTNDESG LTPFSLNEFG FDTPATNDDQ GFNFDEPSIT PFSLDNLGLD PEEQALYSGE FNPAPVAEPT PKVEEEPNMT PFSLTDLGLD ADEIAQFEAM NQAPAAIDDD GFDGGLQPFS LDDLGFDQEP QAYEEPVREL SDSSQPFSLN DIGLSAEEIA AIEQAGQNQS DDPIFDALLG IGQQQGYVDL TDIINQFDDP ESQTEEIDRI ALALHDNGIQ IRDGDEVINM DEEFSGDEAE AYETEEPLEN FIAGDFDQPT AEVEPEMTPF SLTDLGLSAE EIAMLNGETS EAPTEEPAPF SFDNFELEQP SAEVEPEMTP FSLTDLGLSA EEIAMLNGET SEPAAEEPAP FSFDNFELEQ PATDAEPEMT PFSLTDLGLS AEEIAMLNGE TSEAAAEEPA PFSFDNFELD QPATDAEPEM TPFSLTDLGL SAEEIAMLNG ETSEAAAEEP APFFFDNFEL EQPATDAEPE MTPFSLTDLG LSAEEIAMLN GDTSEAATEE PAPFSFDNFE LDQPATDAEP EMTPFSLTDL GLSAEEIAML NGETSEAAAE EPAPFSFDNF ELDQPATDAE PEMTPFSLTD LGLSAEEIAM LNGETSEVAA EEPAPFSFDN FELEQPSAEA EPEMTPFSLT DLGLSAEEIA MLNGETSEAA AEEPAPFSFD NFELDQPATD AEPEMTPFSL TDLGLSAEEI AMLNGETSEA AAEEPARFAF DDFEFEQPSA EAEPEMTPFS LTDLGLSAEE IAALEGTSAP EPEIEAEEPD MSPFTFDQLG LSAEEIAALE GNEAPAAEPA SELEPDLTPF SLADLGLSDD EIASLQQGDD DRSLKLSEEE LMGIDFALPS AEPEPEPVVE AAPAEPEMTP FSWEDLGLSD DEIAIIESPA EPVVEVTPVV VVPTPPAIVE TPPAISATPT PKAEPEADNK RAPLYPVSSR PREEEQRPRA PLYNVSPRRE PKPAAETPVV ETSPVVAQTP TPVAVAPVAS PTVTPSSGGA SGDGPDFSEY YQQLEADPNN HGLRMALARM ISQTASVDQA LNEYKRLIKQ NQLMDQVVDD LQDLIESHDD PGLLQRLHRA LGDAYSKQGR WREAMDEYGW VLNKPRR
|
| |