Gene Haur_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4222 
Symbol 
ID5736076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5378563 
End bp5383836 
Gene Length5274 bp 
Protein Length1757 aa 
Translation table11 
GC content52% 
IMG OID641281377 
ProductTPR repeat-containing protein 
Protein accessionYP_001546982 
Protein GI159900735 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCTCA CCACGATCCA AGCACTCTAC GATGAAGCAC GGTCTGCTCT CGAAACAGGG 
AAGGAGGAGC GTGCCATTGG TGCGAGTGAG CATCTGCTTG AATCGTTTCC CTATTATCTT
GAAGCGTACC GTATCCTCGG CGAATCGTAT CTTAACCGCC AAGATTTAGC CAAAGCCGTC
GAAGCCTTTG AACGGGTCTT ACGTTCTGAC CCCGAAAATA TCCCTGTCCA TGTGGGCTTG
GGGGTAACTT ACGAGCGTCA AGGTAACCTC GCAGCGGCGA TCCGCGAGTT TGAGCAAGCA
TTTGAAATTA AGCCCGATTT GCCTGAGTTG CGCTCGCAAG TGTTGCGTTT GTATACCGAG
GCTTGGGGTA GCGAAAACGC CCGAATCTTG CTGAAAAAGG CAGGTCTCGG GCGTATGTAT
GTGCGCGGAC GACGCTTCGA TAAGGCGATT CAGGAATTTA ACGATGTGCT CGCCGATGAC
CCCAAACGGG TGGATATTGC GGTTGCGCTG GCTGAAGCAC TCTGGCGTAA CGGCCAAGAG
GCTGAAGCCG CCGAAGTTGC CAGCGATATT CTGCGCGATT ACCCCGATAT GCTCAAGGCT
AACTTGATTT TGGGCTATCA TTTGTTGGCC GCTGGCGACC CCAAAGGCCG CAAGTTGTGG
CAACATGCCC AGCAACTCGA CCCCAGTCAA GGCGTGGCCT ATGCCTTATT CGATGGCATG
CTGCCACCAG TCGAAGCCCC CGATACCAAA ATCGAGGCCT TCGATGAAGC CGCTTGGAAA
GCCAAAAAAG CTGAAAAGGC CGAAAAAGAG CGGCTTGCCC GCGAAGCTGC CGAACAAGCC
GAGCGCGATC GCTTGGCCGC CGAGCAAGCG AGCAAAGCGC CAGCCGCCTC TTGGTTGGCC
GAAGTTGAGG CTGCCCCGGT TGCAGTCGGG GTTGCGGCTA ACGCCGATGA TGATTTCTTG
CGTAGCTTGT TGTTTGGCGA TTTTGGGGCA GCTCCTGCTA GTCCTGCACC CACAGCGGTT
GCAATCGCCG AGCCAGAATA CGATGTCGAT TTCAATCTCG ACGACTTGGG CTTAGATCTT
CAGCCCTTCT CGCTCGATGA AGTTGATGAC ACCCCCAGCA ATAAAGCGCC AACGCCTGCG
CCCGAGCCTG TGGTTGCTCC AACCAAGCCT GCTCCTGAGC CAGTAGTTGC CAGCAATGAT
GATTTTGAGT TGCCGGGCGA TCTAACGCCC TTCAGTTTTG GCGATTGGGA TGAAAGCAGC
ATCGATGATA TTCCGGGCAC TGGCGCGGAT ACTGGCAAGT TGCCTGAGCA ATTGCAGCCA
TTCTCCTTGG AAAATTTTGA TGATGTACAA CCTGAAACCA GCTCGAAAGC CCAAGATGAT
GATTTGGCTT TACCAAATAC CTTAAAGCCA TTCTCACTTG ATGAACTGAG CTTTGATTCA
ATCGACACAC CATCAGAGCC AAGCATGCCC TTCAGCCGCG ATTTACCCAG CTTCCCCGAT
ACCGATCAAG AATCAGGTGG TTTTAGCTGG CAACAACCAC GTTCACGTTC GCGCTCGATT
TTTGGGGCAC AACCTGAACC AGAGTTCAAC CAAGATGAAG AAGATGATGC TGGAATTTTC
AACCGCATGG TGATCAAGAA GCAGACGCAA ACCCTGCCAC CATTGGTTGA GCCAGTCTAT
GATCCTGCCG ATTCTGACGA TGAGGCAATG AATTTCTTCT CGAATGATGA TGTTGATTTG
CGTAATTACG ATGAGCATGA TCTGCACACT GCTGATACTG AGCCAGCAAT TACGCCATTC
TCATTAACTG AATTGGGCTT GGATGCTGAT GAAATTGCTG ATTACAATGC CATGGGCAAT
CTCAGCCAAC CGCAAACCCA AGTCGATGAA GAATTGGAAA TTAAGCCCTT CTCGCTGACT
GAACTTGGCT TGAGCGACGA TGAAATTGCC TTGCTGCAAG CTGGCGAAGA TAGCGCTTTG
CCCACCGATC CAACTAACGA TGAATCGGGC TTAACCCCAT TTTCCTTAAA TGAATTTGGT
TTTGACACGC CTGCGACTAA CGATGATCAA GGCTTCAACT TCGATGAGCC AAGTATCACG
CCATTCTCGC TCGATAATTT AGGGCTTGAT CCTGAGGAGC AAGCGCTTTA TTCAGGCGAG
TTCAATCCAG CACCTGTGGC TGAACCCACG CCCAAGGTGG AAGAAGAGCC AAATATGACT
CCCTTCTCCT TGACCGATTT GGGACTTGAT GCTGATGAAA TTGCTCAATT TGAAGCAATG
AATCAAGCAC CAGCTGCCAT CGATGATGAT GGTTTTGATG GTGGTTTGCA GCCATTCTCG
CTCGATGATT TGGGCTTTGA TCAAGAACCT CAAGCCTATG AAGAGCCAGT GCGCGAACTA
AGTGATAGCT CACAACCCTT CTCCTTGAAC GATATTGGTT TATCGGCAGA GGAAATTGCG
GCAATCGAGC AAGCTGGCCA AAATCAAAGC GATGATCCAA TCTTTGATGC GCTTTTGGGA
ATTGGTCAAC AGCAAGGCTA TGTCGATTTG ACCGATATCA TCAATCAATT CGATGATCCT
GAATCGCAAA CCGAGGAAAT CGACCGCATC GCTTTGGCCT TGCACGATAA TGGTATTCAA
ATTCGCGATG GCGATGAAGT GATTAATATG GACGAGGAGT TCTCTGGTGA CGAGGCTGAA
GCCTACGAAA CCGAAGAACC ACTTGAAAAC TTCATTGCTG GCGATTTCGA CCAGCCAACT
GCTGAAGTCG AGCCAGAAAT GACTCCGTTC TCGTTGACCG ATTTGGGCTT GAGTGCCGAA
GAAATCGCGA TGTTGAATGG CGAAACCAGC GAAGCGCCGA CTGAAGAGCC AGCACCATTC
TCCTTCGATA ATTTCGAGCT TGAGCAACCC AGTGCCGAAG TCGAGCCGGA AATGACTCCG
TTCTCGTTGA CCGATTTGGG CTTGAGTGCC GAAGAAATTG CGATGTTGAA TGGCGAAACC
AGTGAGCCTG CTGCCGAAGA ACCAGCACCA TTCTCCTTCG ATAACTTTGA GCTTGAGCAA
CCCGCAACGG ACGCTGAGCC AGAAATGACT CCGTTCTCGC TGACCGATTT GGGCTTGAGC
GCCGAAGAAA TTGCCATGTT GAATGGTGAA ACCAGCGAAG CGGCGGCAGA AGAACCAGCG
CCATTCTCCT TCGATAATTT CGAGCTTGAC CAACCCGCAA CGGATGCTGA GCCAGAAATG
ACTCCGTTCT CGCTGACCGA TTTGGGCTTG AGCGCTGAAG AAATCGCCAT GTTGAATGGC
GAAACCAGCG AAGCGGCGGC AGAAGAACCA GCACCATTCT TCTTCGATAA TTTCGAGCTT
GAGCAACCCG CAACGGACGC TGAGCCAGAA ATGACTCCGT TCTCGCTGAC CGATTTGGGC
TTGAGCGCCG AAGAAATTGC GATGTTGAAT GGCGATACCA GCGAAGCGGC GACCGAAGAG
CCAGCACCAT TCTCCTTCGA TAATTTCGAG CTTGACCAAC CTGCAACGGA TGCTGAGCCA
GAAATGACTC CGTTCTCACT GACCGATTTG GGCTTGAGCG CTGAAGAAAT CGCGATGTTG
AATGGCGAAA CCAGCGAAGC AGCGGCGGAA GAACCAGCAC CATTCTCCTT CGATAATTTC
GAGCTTGACC AACCCGCAAC GGATGCTGAG CCAGAAATGA CTCCGTTCTC GTTGACCGAT
TTGGGCTTGA GCGCCGAAGA AATCGCAATG CTGAATGGTG AAACCAGCGA AGTGGCGGCG
GAAGAACCAG CACCATTCTC GTTTGATAAT TTCGAACTTG AGCAACCAAG TGCGGAAGCC
GAGCCAGAAA TGACTCCGTT CTCGCTGACC GATTTGGGCT TGAGCGCCGA AGAAATCGCG
ATGCTGAATG GCGAAACGAG CGAAGCGGCG GCAGAAGAAC CAGCACCATT CTCCTTCGAT
AATTTTGAGC TTGACCAACC CGCAACGGAC GCTGAGCCAG AAATGACTCC GTTCTCGTTG
ACCGATTTGG GCTTGAGTGC CGAAGAAATT GCCATGCTGA ATGGTGAAAC CAGCGAAGCG
GCGGCAGAAG AACCAGCTCG ATTCGCATTC GATGATTTTG AGTTTGAGCA ACCAAGTGCC
GAAGCTGAGC CAGAAATGAC TCCGTTCTCG CTGACCGATT TGGGCTTGAG CGCCGAAGAA
ATTGCTGCTC TGGAAGGCAC AAGCGCACCT GAACCTGAGA TTGAAGCTGA AGAGCCTGAT
ATGTCGCCCT TCACCTTTGA TCAACTAGGC TTGAGCGCCG AAGAAATTGC CGCATTGGAA
GGTAACGAGG CTCCAGCAGC CGAGCCAGCC AGCGAGCTAG AGCCTGATCT CACGCCATTC
TCGCTAGCCG ATTTGGGCTT GAGCGATGAT GAAATTGCTT CATTGCAACA AGGCGACGAT
GATCGGAGCT TGAAGCTGAG CGAAGAAGAA TTGATGGGCA TCGATTTTGC TTTGCCAAGC
GCCGAGCCAG AACCAGAGCC AGTGGTCGAA GCAGCGCCAG CCGAACCCGA AATGACCCCA
TTCTCGTGGG AAGATTTGGG CTTGAGCGAC GACGAAATTG CCATAATTGA ATCGCCAGCC
GAGCCAGTGG TTGAGGTTAC GCCAGTTGTG GTCGTGCCAA CGCCGCCTGC AATTGTCGAA
ACCCCACCAG CAATTAGCGC TACACCAACG CCAAAAGCTG AGCCAGAGGC TGATAACAAA
CGTGCTCCGT TGTATCCAGT TTCATCACGG CCACGTGAAG AAGAACAACG CCCACGCGCA
CCGTTGTATA ACGTCTCGCC ACGCCGTGAA CCAAAACCAG CGGCTGAAAC GCCAGTGGTT
GAAACCTCAC CCGTGGTGGC TCAAACCCCA ACACCAGTAG CAGTAGCACC TGTGGCTAGC
CCAACAGTTA CGCCAAGCAG TGGTGGGGCC AGCGGCGATG GCCCAGACTT TAGCGAATAC
TACCAACAAC TTGAGGCCGA CCCAAACAAC CATGGCTTGC GAATGGCCTT GGCGCGGATG
ATTAGTCAGA CCGCCTCGGT TGATCAAGCC TTAAACGAAT ACAAACGGCT GATTAAACAA
AATCAACTGA TGGATCAAGT AGTTGATGAT CTGCAAGATC TGATTGAGTC GCACGATGAC
CCAGGCTTGC TCCAACGGCT GCACCGTGCT TTAGGTGATG CCTATTCTAA GCAAGGACGC
TGGCGTGAAG CGATGGATGA GTATGGCTGG GTGCTAAATA AACCACGTCG TTAA
 
Protein sequence
MALTTIQALY DEARSALETG KEERAIGASE HLLESFPYYL EAYRILGESY LNRQDLAKAV 
EAFERVLRSD PENIPVHVGL GVTYERQGNL AAAIREFEQA FEIKPDLPEL RSQVLRLYTE
AWGSENARIL LKKAGLGRMY VRGRRFDKAI QEFNDVLADD PKRVDIAVAL AEALWRNGQE
AEAAEVASDI LRDYPDMLKA NLILGYHLLA AGDPKGRKLW QHAQQLDPSQ GVAYALFDGM
LPPVEAPDTK IEAFDEAAWK AKKAEKAEKE RLAREAAEQA ERDRLAAEQA SKAPAASWLA
EVEAAPVAVG VAANADDDFL RSLLFGDFGA APASPAPTAV AIAEPEYDVD FNLDDLGLDL
QPFSLDEVDD TPSNKAPTPA PEPVVAPTKP APEPVVASND DFELPGDLTP FSFGDWDESS
IDDIPGTGAD TGKLPEQLQP FSLENFDDVQ PETSSKAQDD DLALPNTLKP FSLDELSFDS
IDTPSEPSMP FSRDLPSFPD TDQESGGFSW QQPRSRSRSI FGAQPEPEFN QDEEDDAGIF
NRMVIKKQTQ TLPPLVEPVY DPADSDDEAM NFFSNDDVDL RNYDEHDLHT ADTEPAITPF
SLTELGLDAD EIADYNAMGN LSQPQTQVDE ELEIKPFSLT ELGLSDDEIA LLQAGEDSAL
PTDPTNDESG LTPFSLNEFG FDTPATNDDQ GFNFDEPSIT PFSLDNLGLD PEEQALYSGE
FNPAPVAEPT PKVEEEPNMT PFSLTDLGLD ADEIAQFEAM NQAPAAIDDD GFDGGLQPFS
LDDLGFDQEP QAYEEPVREL SDSSQPFSLN DIGLSAEEIA AIEQAGQNQS DDPIFDALLG
IGQQQGYVDL TDIINQFDDP ESQTEEIDRI ALALHDNGIQ IRDGDEVINM DEEFSGDEAE
AYETEEPLEN FIAGDFDQPT AEVEPEMTPF SLTDLGLSAE EIAMLNGETS EAPTEEPAPF
SFDNFELEQP SAEVEPEMTP FSLTDLGLSA EEIAMLNGET SEPAAEEPAP FSFDNFELEQ
PATDAEPEMT PFSLTDLGLS AEEIAMLNGE TSEAAAEEPA PFSFDNFELD QPATDAEPEM
TPFSLTDLGL SAEEIAMLNG ETSEAAAEEP APFFFDNFEL EQPATDAEPE MTPFSLTDLG
LSAEEIAMLN GDTSEAATEE PAPFSFDNFE LDQPATDAEP EMTPFSLTDL GLSAEEIAML
NGETSEAAAE EPAPFSFDNF ELDQPATDAE PEMTPFSLTD LGLSAEEIAM LNGETSEVAA
EEPAPFSFDN FELEQPSAEA EPEMTPFSLT DLGLSAEEIA MLNGETSEAA AEEPAPFSFD
NFELDQPATD AEPEMTPFSL TDLGLSAEEI AMLNGETSEA AAEEPARFAF DDFEFEQPSA
EAEPEMTPFS LTDLGLSAEE IAALEGTSAP EPEIEAEEPD MSPFTFDQLG LSAEEIAALE
GNEAPAAEPA SELEPDLTPF SLADLGLSDD EIASLQQGDD DRSLKLSEEE LMGIDFALPS
AEPEPEPVVE AAPAEPEMTP FSWEDLGLSD DEIAIIESPA EPVVEVTPVV VVPTPPAIVE
TPPAISATPT PKAEPEADNK RAPLYPVSSR PREEEQRPRA PLYNVSPRRE PKPAAETPVV
ETSPVVAQTP TPVAVAPVAS PTVTPSSGGA SGDGPDFSEY YQQLEADPNN HGLRMALARM
ISQTASVDQA LNEYKRLIKQ NQLMDQVVDD LQDLIESHDD PGLLQRLHRA LGDAYSKQGR
WREAMDEYGW VLNKPRR