Gene Haur_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4022 
Symbol 
ID5735883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5133032 
End bp5134918 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content48% 
IMG OID641281172 
ProductTPR repeat-containing protein 
Protein accessionYP_001546782 
Protein GI159900535 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCTA ATACCAAGCA ATTAATCATT TGGGTTGGCA ATGCTGGCGA GCAACGAACT 
CAGGCTGCTC AAGCATGGTT AGCTGACCAA ACTGCGGCAC GAACTTGGTT GCTCAGCATG
AATCGAGCCA ATTTGGGCCA TTGGGCCGGC TTGAATAGCT TTGTTAGCTC GCTATTACCA
GCGATTAAGG CCCAAGCCCC CGAATTATTA GTCAAATATG ATTATGAGTT GGCCTTGGTT
TTGCCGCAAT TGCAACGTGA ATTGGCTGTG CGCAACCCAA GCCTCACCGA TATTAGCAAT
CCCCAAGAGC GCACCCGCAA TTATGCTGCT GATCGCGCCT TTCGGATTGT CCAAGGGCTG
ATTGATTTGG TCCATGAATT CCGCCAAACT GCGCCAGATC AGCCATGGGT CATTGTTTGT
GATCAGTTTG ATCAGGCTGG CTCGTTGGTA ACCATGTTTT TCAAAGAACT CTTGCGGCGT
GCCGACCCAC GCATGAGCTT AACCATTGGC TTTTTAGTTG AACCCAACCA GCCAGAATTG
CTGAACGAGT ATCACAGCTG GCATGCGCAT GTTGAGGTCG TGCAAGGCGC TTGGCAAGCT
GATCCGCCGA TTGTTATCGA TCCGGCTGAG GCCAAACGCC AGCTCGAAGT GCTGGAACGT
GAGCATACGT TTGATCCAAT TGAGATTGAA TTGCATCTGC CTCAATTGAT TCAGCTGGCA
ACTGCCGCCA ACGAGCCACG CAAACGCTTG GGTTTTATGC ACGAAGGCTT GTCGATCTGT
TCAACTCGTG GTTTATACGC CGATGCCTTG TATTATGGCG AGCCACTGCG AGTCGCCATG
GAACACGAGT TTCCCAAGAG CGTCGATTTT CGCTTGAGTG CCTACTTAAA GCTCTACAAT
TGTTATATTG GGCTAAAACA AGCTGAACCA GCCTTGGATA TTGCCGAAAC TGCGGTGAGC
ATTACTGATA ATCCTGCCCG TTTATATAGC TGGTATTATC TGATTGCAAT GATTCATGCA
CGATTTGCCG AGCCACGCGA TTTTGATAAA GCCGAGTATT ACCTCGATTT AGGCATTGAA
GCGATAAATA AAGCCGATAT TCCAGCGCAT GAAAAACTAT TTCAATCGTC ATTCAATCGC
AATGGTTTAG CATTAATTCG TCACTTCCAA AAACGCCCCT TGGAAGCGAT TGAAATTTGC
CAAGAATGTT ACAAAAATCT TGAAACTGGC TTAGATACCG AAGATCATAA GCTGCATCGC
TCGGTATTGC TCTACAATAT TGCTCAGGTT TACGATTCGC TCAAAGATTA TCCAAAAGCA
ATCGAATATT ATAGCTTGAC GATTGAAATG GACCCCAACT ACGCTGAATA TTATAATGAG
CGAGCCAATA TTTATCTGCA TATCGGCGAT TATGCTGCTG CCGAACGCGA TTACCAACGC
TCAATCGAGC TTAGCCCACC CTACACCGAA GTTTGGACAA ACCTTGGCCA GTGCTACCAA
GTTCAAGAAC ATTTTGAAAA AGCCATTGGC GCATTCAGCC GAGCGCTCGA TATTGACCCC
AAAAATGTGG TTGCGCTCAA TCATCGAGCT GAATGCTACG AGGGTTTGGG CCAAACTCAG
GCGGCAATTG CCGACTACAG TGCCAGCCTA AAGCTGAAAA CCAGCGAAGC CTCAACCTTT
GCCAACCGCG CGATTCTCTA CTACGAGTTG GGCGAAATTG AAGCTTCATT GGCCGATTTG
AATACCGCCA TCAGCCTCCA ACCCGATCTC GCTGAGCTGT ATGAAAATCG TGCGGTCGCT
TTGGAAGCGC TCGAACGCCA TGCTGAAGCT GAACACGATC GTCAACAAGC GATTTTGTTG
GCCAAAGCCC ACGAGGTTAA TGGCTAA
 
Protein sequence
MTSNTKQLII WVGNAGEQRT QAAQAWLADQ TAARTWLLSM NRANLGHWAG LNSFVSSLLP 
AIKAQAPELL VKYDYELALV LPQLQRELAV RNPSLTDISN PQERTRNYAA DRAFRIVQGL
IDLVHEFRQT APDQPWVIVC DQFDQAGSLV TMFFKELLRR ADPRMSLTIG FLVEPNQPEL
LNEYHSWHAH VEVVQGAWQA DPPIVIDPAE AKRQLEVLER EHTFDPIEIE LHLPQLIQLA
TAANEPRKRL GFMHEGLSIC STRGLYADAL YYGEPLRVAM EHEFPKSVDF RLSAYLKLYN
CYIGLKQAEP ALDIAETAVS ITDNPARLYS WYYLIAMIHA RFAEPRDFDK AEYYLDLGIE
AINKADIPAH EKLFQSSFNR NGLALIRHFQ KRPLEAIEIC QECYKNLETG LDTEDHKLHR
SVLLYNIAQV YDSLKDYPKA IEYYSLTIEM DPNYAEYYNE RANIYLHIGD YAAAERDYQR
SIELSPPYTE VWTNLGQCYQ VQEHFEKAIG AFSRALDIDP KNVVALNHRA ECYEGLGQTQ
AAIADYSASL KLKTSEASTF ANRAILYYEL GEIEASLADL NTAISLQPDL AELYENRAVA
LEALERHAEA EHDRQQAILL AKAHEVNG