Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5207 |
Symbol | |
ID | 5737165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 296919 |
End bp | 298934 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282371 |
Product | TPR repeat-containing protein |
Protein accession | YP_001547962 |
Protein GI | 159901716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.143534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTGC CAGCCCTCAC TGCCGTTCTG GTCGATGCGA TTCTTGCCCA ATGCCCGCAG TATGATCGCG TGAGTGTTAC CGTGGCGCTC GAATCGGTTT TTGCGGGCCA TCCCACGCTG CTGGGTGGCA ACTCCATCTC CATGCTATTT GGCCAGAACA ATGATTTTAC CAATGCCACA GTGACGATAG GCGACGTGCA TGCCGGAAAT CAGGTGCAGG TGACACTACC GCAGCCGATT GACCTCTTGC CCGCTGCGCT TGCGGCCCTG GCCTCGATTC CATTGACGGA TGTGCCAGCG CCCCGTTCCG ATCTTCCCCA AGCCTCACGG TTGCCCTTTG AATCGAGTCC CCACTTTGTT GGGCGCGAGA CGGAATTAAA AGCGCTCGCG CAGGCAATTG GCACAACCCA GCCAGCGGTC GTCATGCCAG CGGTGGCGAC GGGATTGGGC GGGATTGGCA AAACCAGCCT GGTGACCGAA TTTGCCTATC GCTATGGCTG GTATTTTCAT GGCGGGGTGT TTTGGCTGAA CTGTGCTGAC CCAAATCAGG TGGCCAGTCA GATTGCGGCC TGTGTGGTTG GCTTGAAGAT TGATACTACC GGATTGTCGC TTGATGAGCA GGTGCAGCGA GTTTTGCATG CTTGGCAATC CCCCATGCCG CGCTTGCTGA TTTTCGACAA TTGCGAGGAT CGGGCGATTC TGGATCAATG GAAGCCCACC GTGGGTGGCT GTCGGGTGCT GGTGACGACT CGATCCGATC AGTGGCCAAC GCTGACGCAG ATTCGGCTTG GGCTGCTCTC ACCTGCTGAA AGTCATTTGC TCTTGCAGCA ACTCTGTGCG CGGTTGACCG ATGCCGAGGC TGATGCGATT GCCGAGGATC TTGGGTATTT GCCACTCGCG TTACATCTGG CGGGTAGCTA TCTCGCCACC TATGATCATC ATACGGTTGA ACAGTACCGT AAGGATTTAA CGATTGCTCA CCGCTCGCTC AAGGGACGGG GAGCATTGCC CTCACTGACA CGGCATGAAC TCGATGTCGA AGCGACCTTT ATGCTCAGTT TTAACCAGCT TAATCCGACC AACGCGCTTG ATGCCTTGGC CTTGGGCATG CTTGATGGTG CGGCGTGGTG TGCACCAGGT GTGCCAATTC CGCGTGATTT GGTACTGGCG TTTGTTCCTG AGGGGGTTGA TGCCGATGAT GCCCTGCGGC GATTGCAACA GCTTGGACTT CTTGATGGGG CGGATGCGGT GGTACTCCAT CGCTTAATCG CCCACCATGT TCATGACCGA TTAGGAGACC GCCGCAATGC CGTGCGGGTC ATCGCACGGT GGGAGTCCCA GCTAGCGCAG CGGCATGGAC AGGCATATTT TGTGCTCCTA CGAGCGGCAC TGAGCCATCT GCGGCATCTG AGTACCGTGA TGCAGGGTTG GGGCGATCCA TGGGTGGCGA TCTGTGCCCG CCACATGGCC GAGTATGCGG ATGCGATGAA CGACGATGCC CTCGCCGAGG AATGGTATAG GCAGTGTTTG CAGGCAGAGC ACCGCGCGTT TGGAGCAGTG CATGGGCAGG TGGCAGCAAC GATGAATGAT CTTGCAGAAA TCTGTGTTCG TCAAGGGAAG AGGCAGGCGG CCCTTGCATA TGGGCAGCAG GCGTACCAGA TCAATCGCCA GCTGTTTGGG GATCAGCATG AAACAACCGC GTTTTATGCC TTGCGCGTCG GGTATTTCTT GCGGCAGCAC GGGTTGGAGC AGCAGGCACA GGCATGGTAT GCCCAAACAT TAACGATCCT GTCCCAGGAT GACCGCGCGG AGGAGCGGGA TGTCGGGTTG ATCCTTGAGG CAACGCTCGC CCTCGTAACC CTTTTGCAGC ACACTGGGCA AGCGGATACT GCCCGTCAGG TCTATGATCG CAGTATGCAG ATACTCCAGC CGTATGCGAT TACCGAGGAG ATGGATGCGT ACTGGGATGG ATGGCAAGCG CTCCAGGAAA CCATGCAATC GCATGGTACA CAATAA
|
Protein sequence | MELPALTAVL VDAILAQCPQ YDRVSVTVAL ESVFAGHPTL LGGNSISMLF GQNNDFTNAT VTIGDVHAGN QVQVTLPQPI DLLPAALAAL ASIPLTDVPA PRSDLPQASR LPFESSPHFV GRETELKALA QAIGTTQPAV VMPAVATGLG GIGKTSLVTE FAYRYGWYFH GGVFWLNCAD PNQVASQIAA CVVGLKIDTT GLSLDEQVQR VLHAWQSPMP RLLIFDNCED RAILDQWKPT VGGCRVLVTT RSDQWPTLTQ IRLGLLSPAE SHLLLQQLCA RLTDAEADAI AEDLGYLPLA LHLAGSYLAT YDHHTVEQYR KDLTIAHRSL KGRGALPSLT RHELDVEATF MLSFNQLNPT NALDALALGM LDGAAWCAPG VPIPRDLVLA FVPEGVDADD ALRRLQQLGL LDGADAVVLH RLIAHHVHDR LGDRRNAVRV IARWESQLAQ RHGQAYFVLL RAALSHLRHL STVMQGWGDP WVAICARHMA EYADAMNDDA LAEEWYRQCL QAEHRAFGAV HGQVAATMND LAEICVRQGK RQAALAYGQQ AYQINRQLFG DQHETTAFYA LRVGYFLRQH GLEQQAQAWY AQTLTILSQD DRAEERDVGL ILEATLALVT LLQHTGQADT ARQVYDRSMQ ILQPYAITEE MDAYWDGWQA LQETMQSHGT Q
|
| |