Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2373 |
Symbol | |
ID | 5734254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3024060 |
End bp | 3026816 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279514 |
Product | tetratricopeptide TPR_3 |
Protein accession | YP_001545141 |
Protein GI | 159898894 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCTC CTGAGCGAAT GCTTGAGCGG CAGCAACGCT ATCAGGTAGC TGACGACCAT GAGTTTGCAT GGCGACAATA TTGTCTTGGT TTGGCCAACC TTCGGCTTCA GCGGCTTGCG GCGTTGCAGT TGGCCCTGAG CCAAGCCCGC CAAGCATTTG GGCTGAGCCA TGATCCGCTT GGCCTTTTGT ATTGTGAGGG TTTGGCTTTA GCTGGGGCTT GGCTCGCTGG AACAAATGTC GAGAATTTGC AGCGACTGGA AGGCTTGGCT CACCGCTTTC GTGCGCTTGG CTATCCACTT GAGGCAACTC GGACGGATCT GCTGCGGGTT TTGCATTATT ATAGTCTTGG TCAGGTTGAG CGAGGGTTGA GCTTGATTCA GCCGCTTGAA GCCTCGATTG ATCAGTTGGG CAGCAGATAC GACCAAGCCT ATTGGCTATG GATTAGTGGG ATGACCTATA GTCATAACCA TGATTTTAAC CAAGCTGGTC GTCTGCTTGA TCAGGCATTT GTGCTTTTTA GCAGCTTGCA CATGCAACCA GAGTGTGCCC GTTGCTTGTT TGATCGTTCA TGGTTGTGGC AGCGCCAAGA GCGCTATGCT GCGGCCTTTA ACGACCTCGC GGCGGCTCAG ACACTTGCCG ATGATCTGAA TTTGGTTTAT TTGATTGCCG CATGTGCCAA AAATACAGGC TTGGCGTTAA GTCGTTTGGG GCGTTATGCT GCCGCGTTGG ATTGGACACT AGATGCCCGT CGGCGCTATG GGCAGCTGGC GCGGCACGAT TTTATCAATG GTTGCGAGCT GAATCTCGCC AATATTCTCT ATTATGCTGG CTTGTATGAG CTTGCGCTTA AAATGTTCGA ACGGCTTGAG GGGAATTATC GTCGGCTAAA TTTGCTGCGG ATGGCAGCTG GGAGTGTCCG CAATCAGGCA CTTGTATTGC GTGCTCAACA TCAAGCACAG GCTGCCTATG ATCTATTGCG TTCGATTGAA GCGCCGATTT TGCAGGGTAC TAATCAATTA GAACAAGCCG AATATTGGCA AGCGCTGGCT TTTGCACTTG CTGAGCAGGG TGATATTGAG CAAGCAACCA CCATCTTCGG CCACGCCGAA AGCCTTTTTA AGCAACTTGA TAATCAACCT GCTGCTGCAA AATGCCGTTT AGAGTTAGCT TGGCTGGCGC TTAAACAACA TCAATCAACA GATGATTCTG ATAATGCCCA GATACACACA ATTAAACACC AGCTTCAGCA AACTTTGGGC TATTTGGATG ATCGCCCATT CTATCGCTGG CGCATTTTGT ATGGGCTAGG TTTGTGCGAG GCGGCCTTTG GTAAAACCGC AGCAGCCCTT GATTACTATA CCCAAGTTAG TCTTATTATC ACCAATTTGC GCCTCGAATT TTTCGATGAA CATGCCTCAA GTGCGATCTT TGTGCAGGCC CACGATCTTT TTACGGCGGC GCTTGATTTG ACTCTCCAAG CGAATAATCC GCTGGCCTTA TTGCAATTGA GCGAACAACA ACGCGCCTTG GTATTACGCC AACATCTGGA TTCAGGGCGC TCCAATCGTC GGATTCCGCG GGCGTTGCAG GCACAGTTGT GGCGTGGTTT GCGCACGGAT AGTCTGGGTT TCTTGGATGA TCTGCCGCCT GTTGGCAAAC TTGAGATTCC AGAGATTCAA TTTGACATCA CTGAAGCCTT TGATCAGCTT GATATTGCCC AGCTTCGGCA GCATTTAACG GCGGTATATC AGGCGCGTTG GGTGGTGTTG GCCTATATTT GTCATGGCTC ACGGTTAATT CGCTTAACCC TCACCCCAAC CATGCTGCAA GCTGATGTGA TTGTGCTTGA TGCTGGCTTG AAACGCTTGA TCGAACGAGC AAGTTTGGCA CGATATCGCT GGTTTACCTA TAGTGGTCAG CAAGCAAGCG ATCAGCCGAT TTGGCCTGAG CTTGAGTCGC TTGCGTCTCA CTTATTGCCC CCAATTGTTT CACAACCCCT TGATTACTTG CTGATTGTGC CCGCTGAGCC GTTACATGGA ATTGCATGGG GCACGTTACG ACTTAACCAA AAATGGTTGG CTCAGCATAC AGCGATTAAT ATTCTGCCTA ATCTTAGCCA CATTCAACCA ACCGCCATGG TGGTCAAACC ATATAGTCAG GGGGTTATGA TTGGGTGTAG CGAGTTTCGT GATGATTTGC CAGCGTTGCC AAATGTTGAA CCTGAAATAG CCCAACTTGA AGCAATTTAT GCTGATCAGC CTACTCTGAC CCTCCAGAAT GCGGCTGTAA CCAAAGCCCG AATCTTCGAG CTATCCAACG CCGGGAGCTT GCTAGGCTGT CGATTTTTGC ATATTGCCAG TCATGCCCAG CTACGCTCTG GCGCAGGCCA AGCTGCCTAT ATTCAATTGT GGGATGAGCG TTTAAGCTTC GATCAGATTA TTGATCTGCA ACTCCAAGGA ACATGGGTTA TTTTATCGGT CTGTGATGGC TCGGCGAGCG ATGTTTTGGC GGGCGAAGAG GTGTTGAGCT TGAGTCGGGC TTTTTTGGCA GCAGGAGCTA CGGCGGTAAT TGCCAACCTT TGGAAGCAAG CAGATGATGC TTCGCCTCGC TTGATGGCCC GCTTGCATAG CTTGCTCCAT GCTGGTAGCG ACCCAGTACG AGCACTATGT TTGCTCCAAA ATGATTGGCT TATGCACTAT GATAACGCTT CTCCCTTAGT TTGGGGCGGG CTTCAAGTTA TTAGTAGTTT GGCCTAG
|
Protein sequence | MHAPERMLER QQRYQVADDH EFAWRQYCLG LANLRLQRLA ALQLALSQAR QAFGLSHDPL GLLYCEGLAL AGAWLAGTNV ENLQRLEGLA HRFRALGYPL EATRTDLLRV LHYYSLGQVE RGLSLIQPLE ASIDQLGSRY DQAYWLWISG MTYSHNHDFN QAGRLLDQAF VLFSSLHMQP ECARCLFDRS WLWQRQERYA AAFNDLAAAQ TLADDLNLVY LIAACAKNTG LALSRLGRYA AALDWTLDAR RRYGQLARHD FINGCELNLA NILYYAGLYE LALKMFERLE GNYRRLNLLR MAAGSVRNQA LVLRAQHQAQ AAYDLLRSIE APILQGTNQL EQAEYWQALA FALAEQGDIE QATTIFGHAE SLFKQLDNQP AAAKCRLELA WLALKQHQST DDSDNAQIHT IKHQLQQTLG YLDDRPFYRW RILYGLGLCE AAFGKTAAAL DYYTQVSLII TNLRLEFFDE HASSAIFVQA HDLFTAALDL TLQANNPLAL LQLSEQQRAL VLRQHLDSGR SNRRIPRALQ AQLWRGLRTD SLGFLDDLPP VGKLEIPEIQ FDITEAFDQL DIAQLRQHLT AVYQARWVVL AYICHGSRLI RLTLTPTMLQ ADVIVLDAGL KRLIERASLA RYRWFTYSGQ QASDQPIWPE LESLASHLLP PIVSQPLDYL LIVPAEPLHG IAWGTLRLNQ KWLAQHTAIN ILPNLSHIQP TAMVVKPYSQ GVMIGCSEFR DDLPALPNVE PEIAQLEAIY ADQPTLTLQN AAVTKARIFE LSNAGSLLGC RFLHIASHAQ LRSGAGQAAY IQLWDERLSF DQIIDLQLQG TWVILSVCDG SASDVLAGEE VLSLSRAFLA AGATAVIANL WKQADDASPR LMARLHSLLH AGSDPVRALC LLQNDWLMHY DNASPLVWGG LQVISSLA
|
| |