Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3508 |
Symbol | |
ID | 5735369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4417438 |
End bp | 4418718 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280655 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546272 |
Protein GI | 159900025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCTG ATGCTGCCTA TCGCCAATTT CATCGTGATG CGGCAGAGCG CTTGCGTCAA CTCTACGCTG AACAGACAGG GCGGCCTGAA GGTTTGGATG GAATTATGGC ATGGCATTTA GCATTGGCCT GTGAATGGCC CGCCGCAATT ACGGCTGCGC TTGATCAAGC AGAAACGGTG ATTGCTCAAC ATAATATTGC TGATGCGCGA GTTTGGTGTG AACGGGCCTT GGAATATCTG AACCGTTTGG CCGAGGAGCA ACGCGCTAAT TTTGCGATTC GGGCCTACAG CTTGGCTTGG ACGGTGCTGG ATTGGGCGGG CAATCACGAT GATGCCTTGT TTTACGCTCA GCAATTGTTG ACCTTAGCCC GCGCCGCGCA ATCGGTGCAA ATTGAGGGTG GTTCGTTGGT GGCCTTGGGG CGTTCGCAAC GAGCAATCCG CGATTATAGC AGCGCCGAGC AAACATTAAA ACGGGCGCTC GATTTGGCTC GAACTACCAA CAATGTGCCG CTTGAGGCCG AAGCTTTGTT GCATCTTGGT AAAACTGAGC AACTCCAAGG CCACCATACC GCAGCAATTC ACCTCTATAA CGCCGCACAA GCACGCGGAG CGGTGATTGA CGATCGGCTA ACAATCGCTA AAATTATCAC AAGCATTGGT GATGTGTATC GGTTGATTGG TTCCGGCCAA CAAGCAGCCG ATTACTACTC ACGCGCCTTA GAAATCGAGG AACATCTGCC TGGCACGTTG GGCGTGGCGA TTGTTCATGA AAAATTGGCG CTTTCGTATC TCGAATTAAA CGATTTGGAT CAAGCGCTGC GTTGCCAAGC TGAAAGCCTG ATGCTGCGCG AAAAATTAAA CGATGCGGTT GGCACCGCCC GCGCCTACAC CGTGCTTGGG GTGATTCACC ATGCCCGCGG CGATTATCAA ACGGCGATTG AGTCGTTGCT CAAAGCCTTG CAATATGAGG ATCGGCGCAC GCCTGGAGTT GGCCAGATTA TGCTGCACAA TCGCTTGGGC GATGCCTATC GGGCGCAAGC TGATTATGGC TCGGCCAGCA CCAACTACAA TATTGCCCTG ACTAATGCCC AACAATTGGG CGATACAGTC GGGATTGCCT TGGCCAGCGA GCGTTTGGGC GATTTGGCCT ATGAGTGGAA TGATCGCCAT GATGCGGCCA AGCATTGGCA TACGGCCTTT TTATTGCGCC AAGGTCTTGG CCATTACGAT GAACAAAAAC GCTTGCGTGA ACGCTTGCGA ACTATAGGAA TACAGGTTTA A
|
Protein sequence | MIADAAYRQF HRDAAERLRQ LYAEQTGRPE GLDGIMAWHL ALACEWPAAI TAALDQAETV IAQHNIADAR VWCERALEYL NRLAEEQRAN FAIRAYSLAW TVLDWAGNHD DALFYAQQLL TLARAAQSVQ IEGGSLVALG RSQRAIRDYS SAEQTLKRAL DLARTTNNVP LEAEALLHLG KTEQLQGHHT AAIHLYNAAQ ARGAVIDDRL TIAKIITSIG DVYRLIGSGQ QAADYYSRAL EIEEHLPGTL GVAIVHEKLA LSYLELNDLD QALRCQAESL MLREKLNDAV GTARAYTVLG VIHHARGDYQ TAIESLLKAL QYEDRRTPGV GQIMLHNRLG DAYRAQADYG SASTNYNIAL TNAQQLGDTV GIALASERLG DLAYEWNDRH DAAKHWHTAF LLRQGLGHYD EQKRLRERLR TIGIQV
|
| |