Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1707 |
Symbol | |
ID | 5733594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1985838 |
End bp | 1988378 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278849 |
Product | TPR repeat-containing protein |
Protein accession | YP_001544478 |
Protein GI | 159898231 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCTT TCGATGCCGA TTCGACCTAT CTCGCAGTGC TGTTGGTGTT TGAGCCATTT GATTCTGAAT TATGGCCGCT ACTGGCTCCG GCTGCGCCAA GCGTCGAGCA ACTTCAAGCT GCTGGTTGGT TGCAAGCGAG CGCCGCTGGA TTAAGTTTGC TGACAGAGCG CCGCGAACAG CTGCAAGCCC AGTTCGATTC GGCCAGCATT CGCGCAGCCT ATCAGCAATT AATCGCTGCT GAACTGAGAT TAAGCGACAA TCAACCAGCG TTTGCCGCCC GTTTATTTGC CGATCTGCGA ACGTTTGCCG AGTTGCTGAT TCGCCAAGCG CCACATGAAC TGGCCGATCT GGTCAATCAA ATTCCAGTTG ATTTAGCGCC AACCAAGGCC GATCGCCAAC TGTTGGAGTA CTACCGTGGC CTCGCCGCTG GCCTGCGTGA TCAATATGCA GAGGCATGTA CGTTGCTCAG CCAATTATTG GCGCAACCAG ATTTGGAGCC AATGATTCGC GGGCGGGCGC TCAACTCCGA TGCCACGTTT GCCCGCTACA GTGGCAATTA TGATCGGGCG TTGGCCAATT ATCAAGCCAG TTTTGCGCTG TGGGAAGCCC AGGCTGATCC AGTTCGTCAA GCCTATGTGC TGCTGAATGA AGGTAGTTTG CGCTACCACC TGCAAGAATA TCGCGCTGCT GAACGCTGCT TGAATAGCAG TTTAGCGACC TTACAAGCCC AAAATTTATT GTATCCCCAA GCTTTGGTGC TGATCAACCT TGGTTTGTTG GCGCGTGATC GTGGCAATTG GGCGCAGGCC TTAAGCCATT TTCGGCAGGC TGAAGCTATT TTGCAAGCTG AAAATGCCAC CGATTTTCTG GGGCGCATCG CCAATAATTT GGGCGAATTA GCCTTGTTGC AGGGCCAATA TCAAGCTGCT CGTGAGCATT TTGAGCAAGC CCTAGCCCAG ATGAGCAGTC GAGTTTATCA CATTGATGCC TACTTGAATT ATGGTTTGGC CTGGCATGTT GAAGGCATGT TTGAGCAAGC TGAAACCGCC TATCGCCAAG CGCTCGATCT GGTGGAAAGC GTTGAGCGCC AAGAAATCGC CGCTTTAGTT TGGTTTCGTT TGGGCCAAGT TGCAGCGGCT CGCAATGATC ATCACCAAGC CGAGCAGCAT TATTTGCAAG CAATTGAGCT GATCGAGGCG ATGCGTGCAC CCATTTTGGC CGAAAGCTTG CAAATTAGTT TGATGGGGCG GTGGCAACAG GTCTACGAAG GGGCGGTGGC GGCCTATCTC GCTCAATCCA ATGTTGAAGC TGCCTTTGTG ATGGCGGAAA AAGCCCGTGC TCGCGCACTC AACGATTTGC TGGCCCGCAA CGGCCAAACC AACCAAGCAA TTGGCACAAT TCCCAGTTTG AGTGAGTTGC AACAAAGCTT GGCGCAGGGC AGCCTTATGC TCGATTATAT GACAATTGGT GCGGTTGGGC CTGAGGCCAG TTTGTTGGCG GCTTTGCCTG CGAGTGCCAA AGCCTTGCGC AGCTTGTTAA TCCAGCCGGC AGCAACTTGG CTATTTGCAA TTACCGCTGA GCAAGCTCAA GCCTTCAATT GCCAAATCGA CCCCAATATT CTCTTGGCGA CCTCACCATT TCAGTGTGAT GGACGACGCT TTTTGCGTCC GGCAATTTTG CAGCGCTTGC AGCAACGTTT GCTTTTGCCA GCTCAAGCTT ACTTACAACA GGCGCAACAG GTGATTATCG TGCCGCATGG AGCTTTGCAT CATGTGCCGT GGAATGCCCT GTTGCTGGGC GAACTTCAGC TTGATCTGCC TTCGACTACC ATTCCAAGCG CTGCTAGCTA TTTGCAATTG AGCCAACGCC CACCGAGCCA AGCTCCCGAG GCTTGTGGGG CATTAAGTTA CGCTGGTGGG GTTGAACCAG CCTTGGTGCA TACGCATGCC GAGGCCGAAG CAGCGGTGCA GGCACTTGGC GGCCAGCATT ATCCATTGCC AGTGCCCAAT ATTCAACAAG CGCTTGGCAA TTATCGCATT GTGCATATTG CCTGCCATGG CGTGTTTGTG CTCGACCAAC CTTTGGCTTC ATGGCTGCAA TTTGGGCCTG AGCAAACCGT CTCGGCTTTG GAGATATTAA CAACTTGGCA ATTAGCCGCT GATTTGGTGG TGTTGAGTGC TTGCCAAAGT GGTGTGAGCG AAATTGTGCG GGGCGACGAG CCATTTGGTT TGGTGCGGGC ATTTTTGGCA GTTGGCGCAC GCGCAGTGTT GGTCACACTA TGGCCAGTTG ATGATGTGGC CAGTGCGGTG TTGATGAAGC TATTTTATCA AGCCCTGCAA AGCGGTGCTG CCCCCGCCGA GGCCTTACGC CAAGCAGTGC AACAGATTCG CAGCATGCCT CAAACACAGG TAGCTATGCC GCTTGCTCCA AGCCAGCAGA CAGAGTACCC GTTCGCCGAT CCGCATTATT GGGCGGGCTA TCAGTTAATT GGCGTTGGCA GCTCGATTTC TACTGTCAAG CCAGCCACAT CAGCCGCTTG A
|
Protein sequence | MTAFDADSTY LAVLLVFEPF DSELWPLLAP AAPSVEQLQA AGWLQASAAG LSLLTERREQ LQAQFDSASI RAAYQQLIAA ELRLSDNQPA FAARLFADLR TFAELLIRQA PHELADLVNQ IPVDLAPTKA DRQLLEYYRG LAAGLRDQYA EACTLLSQLL AQPDLEPMIR GRALNSDATF ARYSGNYDRA LANYQASFAL WEAQADPVRQ AYVLLNEGSL RYHLQEYRAA ERCLNSSLAT LQAQNLLYPQ ALVLINLGLL ARDRGNWAQA LSHFRQAEAI LQAENATDFL GRIANNLGEL ALLQGQYQAA REHFEQALAQ MSSRVYHIDA YLNYGLAWHV EGMFEQAETA YRQALDLVES VERQEIAALV WFRLGQVAAA RNDHHQAEQH YLQAIELIEA MRAPILAESL QISLMGRWQQ VYEGAVAAYL AQSNVEAAFV MAEKARARAL NDLLARNGQT NQAIGTIPSL SELQQSLAQG SLMLDYMTIG AVGPEASLLA ALPASAKALR SLLIQPAATW LFAITAEQAQ AFNCQIDPNI LLATSPFQCD GRRFLRPAIL QRLQQRLLLP AQAYLQQAQQ VIIVPHGALH HVPWNALLLG ELQLDLPSTT IPSAASYLQL SQRPPSQAPE ACGALSYAGG VEPALVHTHA EAEAAVQALG GQHYPLPVPN IQQALGNYRI VHIACHGVFV LDQPLASWLQ FGPEQTVSAL EILTTWQLAA DLVVLSACQS GVSEIVRGDE PFGLVRAFLA VGARAVLVTL WPVDDVASAV LMKLFYQALQ SGAAPAEALR QAVQQIRSMP QTQVAMPLAP SQQTEYPFAD PHYWAGYQLI GVGSSISTVK PATSAA
|
| |