Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2257 |
Symbol | |
ID | 5734144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2885853 |
End bp | 2888339 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279398 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001545025 |
Protein GI | 159898778 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCC TGCCTAATGC TGAAACGCTG CAACAAGCCG AAGCCTTGCT CGCTCAAATT CCACTTGATC GCATTCCAGC CCATCAACGC CCACGCGCTG ATTGGATGCT TGACGATCAG TTTCCGATCA GCAGTTTTGT TGGGCGCGAA GATTTGTTAA AGCAATTAGC GGCGGCCATG GCCTCAACCA CCCCAACTAT GATTGTGCCA ACGCTGGCGA TTACTGGCAT GGGCGGCATT GGCAAAACCA GCCTTGCGCT CGAATTTGCC TATCGCTATG GCCATTATTT TGCTGGTGGC GTATATTGGA TCAACGCCGA CTATACGCCA ATCGCCACCA CGGCTGCAAC GATCTTGCCC TCGGTTGATC GATTGTGGCA AAAGTTATTT CCTCAGCGTG ATAGCAGCCA AATCAGCCCT GAACAGCGGC TTAACGAGAT TAAAAGCTTT TTCAATAGCC CAATTCCGCG CCTGTTGATT TTTGATAATT GCGAGCAACA ATGGATTTTT GAAAGCTATC GGCCAGGCCC GCAAAGCGGC TGTCGTGTGC TGATGACTAG TCGCAACGCG GTTTGGTCAT CGAGCAATGT TCGTGCAATT GCGATTGATT TGCTGACCCC AGCTGAAAGT CGCCAGATGC TGCAAAAACT TGCTCCACGA GTTACCGATG CTGAGGCCGA TGATTTAGCC AAACTGGTGG GTTATTTGCC TTTAGCATTG CATGTAATGG GCGTGGCCTT GGGAACACTT GAACCATCAT TACCTGTCGC CAATTATTAT CAACGAGTGC AGCAAGCCTT AGTAGCCGAA CTCGAAACCA GCGCCAACAC CTTGCAAAAC CTCCATCGTT CGCCAACTAA CCATCAATGG AGCGTCGTTG CTACGGTGCG CGTGAGCTAT GGCCTGCTCA AACGCAGCTA TCAGGATGAA GCCAAACTAC GCCATCTGTT ATTGTTGCTG GCATGTTGTG CGCCAAATGC GCCAATTCCA ATCGATCTGT TGGTACGAGC AACCGAGCAG GATTCGGCGA CCGTTGGCGG ATGGTTGTAC GTACTACGCC AAAGCGGCTT TTTCGATCAC GACCCACCGC AGTTGCACCC GCTGATGCGC GAGGCGATTC GCATTATTGA AGCCGAGCAC TACCCCAACG CCGCCAACAT AATGACCGCT GCTTTGGTGG CTGAGGGCAA GGATGCTCAT GAAAAATGGC AGCGTGAGGC GATGTTGGCT CTAGTGCCTC ACCTAAGCGC TTGCCATGAA ACCGAAAAAA CGAAGCAAGG TTATACGGGC AATGTACTAG CAATTATCGC TCAAATTAAC CAACGACAAG GTAACTACCG CCAAGCAGAG CAATCTATGC GTGAAGTGTT AGAGTATGAA ATTGCCGTGT ATGGTTACGA GAAACAAGAA GTGATTACAA CTCAGCATAA TCTTGCTAAT ATATTATACG ATCAAGGTCT ATATATAGAA GCATTGAACC TCTTCCAAGA AATACTAACT ATCGAACAAC AAATATTAGA CGCAGAACAT CCCCATATTC TAGCTACTAA ACATGAACTC GCAAGGGTTT TACAGGCCCA AGGTGAATAT GCACAAGCCT TGGAGTTATA TCAAACCGTC CTTGCTAGTA ACCAACGAGT TTTAGGCACT GATCATCCTT CAACTCTCGC TTCTCAGCAT AATATTGCAA GTGTGTTTCT TGCCCAAGGA GATTACATCC AAGCACAGGA ACTCTACCAA ACAGTCTTTA CCATTCAACA ACGAGTTTTG GGCGAAAATC ATCCTTCTAC CCTTGCCGCT CAACATGAGC TTGCGAGGGT GTTAGTCGCT CAAGGCAACT ATGTAAAAGC ACAGGATATT TTCAAGGCAG TCCTTGTCAT TAATCAACGA AACTTAGGGA CGGATCATCC TCATACACTC ACCACCCAAC ATGAACTTGC GAGGGTATTC CTCATGCTAG GTGACTATGA TCAATCCTTG GATCTCTTTC AAACAGTTCT CGTTATAAAT CAAAGGGTTT TAGGGGCAGA GCATCCTTTG ACCCTCTCCA CTCAACATCA TCTAGCAAGC ATATTTCTCG CTCAAGGCAA CTATGTAAAA GCACAGGAAG TTTTTCAGGC AGTTCTTCCT ACCAAACAAC AGGTTTTAGG CGCGGAGCAT CCCGATACCC TCGCTACCCA GCATAATATA GCAAGTATAT TTTATAGCCA AGAAGCCTAC GACCAAGCCC TAGATATTTC CCAAACAGTC CTCAACATTG AAAAACAAAC TTTAGGAGAT GATCATCCTG ATACTCTCAT AACTCAATCC AATATTGCTG TATGCATGGC CCGACAGGGC CAATATTACG AGGCTGTAGC GTTGTTCTAT GAGATTATCC CCAAGCAAAT TCGCTGTTAT GGAGGAACTA CTCATCCCAA GGTGCAGGCG AGCATTGAAA TTCGTGATGC AATTGTGGCG GCTTTTTGGC AAAAGGAGCA AGGCTAG
|
Protein sequence | MTTLPNAETL QQAEALLAQI PLDRIPAHQR PRADWMLDDQ FPISSFVGRE DLLKQLAAAM ASTTPTMIVP TLAITGMGGI GKTSLALEFA YRYGHYFAGG VYWINADYTP IATTAATILP SVDRLWQKLF PQRDSSQISP EQRLNEIKSF FNSPIPRLLI FDNCEQQWIF ESYRPGPQSG CRVLMTSRNA VWSSSNVRAI AIDLLTPAES RQMLQKLAPR VTDAEADDLA KLVGYLPLAL HVMGVALGTL EPSLPVANYY QRVQQALVAE LETSANTLQN LHRSPTNHQW SVVATVRVSY GLLKRSYQDE AKLRHLLLLL ACCAPNAPIP IDLLVRATEQ DSATVGGWLY VLRQSGFFDH DPPQLHPLMR EAIRIIEAEH YPNAANIMTA ALVAEGKDAH EKWQREAMLA LVPHLSACHE TEKTKQGYTG NVLAIIAQIN QRQGNYRQAE QSMREVLEYE IAVYGYEKQE VITTQHNLAN ILYDQGLYIE ALNLFQEILT IEQQILDAEH PHILATKHEL ARVLQAQGEY AQALELYQTV LASNQRVLGT DHPSTLASQH NIASVFLAQG DYIQAQELYQ TVFTIQQRVL GENHPSTLAA QHELARVLVA QGNYVKAQDI FKAVLVINQR NLGTDHPHTL TTQHELARVF LMLGDYDQSL DLFQTVLVIN QRVLGAEHPL TLSTQHHLAS IFLAQGNYVK AQEVFQAVLP TKQQVLGAEH PDTLATQHNI ASIFYSQEAY DQALDISQTV LNIEKQTLGD DHPDTLITQS NIAVCMARQG QYYEAVALFY EIIPKQIRCY GGTTHPKVQA SIEIRDAIVA AFWQKEQG
|
| |