Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3418 |
Symbol | |
ID | 5735279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4305180 |
End bp | 4308101 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280565 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546182 |
Protein GI | 159899935 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTT ATCCGACCTA TGTTGAACGC CCACGTTTAC TCACTTTGAC CCAACATGCG CCACGGGTAG TTTTAATCAT TGGGTTGCCA AGCAGCGGCA AATCCACAGT GTTAGAGGCA CTCAAGCATT CCATCAACGC CCCGTCTGCC TTGATTCGGC TTGAAGAGTC AGCGCATGAT CCCCTAGCGC TGCTTCAGGC TATTTGCCAA CAACTTGGCT TACCTAGCGA TCATGCACAT GAGCAGCTTT TAAGGTTACG CCAACCAAGC TGGATTTTGA TCGACGATTT GGATCGGATC TTGGATGCGT GGGGGACGTT CTTGCCTGAG CATGCTTGGC TGAGCGAATT ATTGAAAAAC CCAGCCCTGC ATATTGCTGC TACTTCGTTA CATCTGCCCA ATTTGCCCAT GATTCAACAA TCGATTATTC GTCGGCAATT GCGCATAGTG GGTATGCAGG AACTGGCCTT CACCAGCGCA GAATTAATCG AGCTGGCGCA ACAACGGCAA CTCCACCTAA CCCACGCTCA AGCTGAGCAA TTAATCGCCT TGACCGATGG TTGGGTTGCT CCTTGTATTT TTGCGCTCAA TCGTTCTGGT GTGCTCAATC AATCGGCGGT AGAGTGGGTG ATGCAAGATG TGATGGCGAG TTTGCCAGCG CCGTTGCATC AGATGCTTGA AGCTTGGCTT TGGCTTGCCC CCGATTCGAC CGAATTGCTC CAAGCGTTAC TGCCACACTA TGATCTGCAT CGTCTGATGA TGAGTTGGGA ATATGCGGGG TTGTTGGCGC AGTCGAGTCT TCAGTTGCAG CGTTTGCCCG AGGTCGGTCT GCGCCAGCGT TCTGATCCAC AGGCCAGCTA TCTGAGCCCA TGGTTTGAGC AGGCTGGCGA TTGGTATCTG GCGCAACAAC AATTATTGCC ATTTGTACAA AAAATGCAGC GCTTGCAACG TTGGGATGTG ATTAACCATA GCCTAACGCG CAATTTGGCA TTGATTGATC GTCAGCAGTT TTTTGCTGAG ATTGTCAATG CCTTGAATGC GGTTCCAGTT CAGCAGCTTA GCTCAAAAGT AGGTTGGTTG TTGGTGCGCA GCTACCATCT TGGGTTACGG AATGGTGAGA AAGCCTTGCT TTTGGTTAAT CAGCTGCTAG GCATTCATCA GCAGCCCGAT GATCAGCGCA TGTTGCGCTT GCTCAAGGCC GATATGTTGC GAGCACAGGG CGCAATTGTC GAGGCTGATG GTTTAATTCA GCCCTATCTG GATGACCCTG ATTTATCGCC AGCCGATCAG GCGCGGGTGC TCCGAACGCA TGCAAGTTAT CTTGTATCGT TGGGTCAAGA TGATAATGCG ATTACCCATT TTGAGCGGGC ATATGGCTAT ATTCAACATT CTGGCATTAA TCGTTTGCTC GGGTTGATTC TCGGTGATTA TGCGAATGCG GCGGCACGCG CCGGACGCTA CTCGTTGGCC GATCGACTGT TACGCCAAGC GACGTTGCTG TGGAATGAAT TAAATTCGCC TGCGGGTTTA TCGCAGACGA TGAATCTGCG GGCGGTGGTG GCACTGCATC GTGGCCAAAT TCGTGATGCA GCCCGCTACG CTCAAGAAGC CCTCGATAAT GCCTTGATTT CCGATAATCG GGCTGCCAAT GGTGCCTTGA TCACGTTGGG CGATGTAGCG TTAGCTGATC GTGATTGGAC AAGTGCGATT GCGCGTTATC GTTCGGCGCG TGAGCAATTA CGCAATAGTA ATATGGTCGA TTATCATATT TTGACCTATG CCTTGGCCTT GGAATCGCAA GCTACGCGCC ATAGTTCGAT CGAGCAGTTG CAACGCTTAT TGATTGAGAT CGATACTACG ACGGCCCAAA ATCCGCTTGA TCAGGCTTGG TTGGCAGTAG CTCGTAGTGC TGCTAACTTG ACATTGAAGC TTGATGGCAC GATTCCGTTA TTGCAGCGAG CGTTGGCTGG GCTTGATAGC GAAGGTGCGT TGGCCAAAGG TTTATTGTAT CTCTTGCTCA GCGAAGCTTT TTGGCAGCGT GATCAATTTG GTGAGGCTCG CCAAGCGTGG GAAGCCCTTG ATAAGTTGAT TATTGATGGG CGCAGTGGCT TGCCCGTGCT ATTGTCGGCG CTGACCGTGC ATCTCCCTGA ATTGGTTCAT GAGGCCTATA CCCAATGGAA TTCACCATTT GCGGCACGGG TGCTGCACCA TAGTTTGCCA ATGCCGCAAG CACCAAAATT GATTATTCGC TTAATGGGGC AGGTGCGTGT CGAGATGCAC GGCAAGCCTG TCAAAATTCC TAAACAAGGT ATTTTATTAA TTGCCCTGTT GCTTATGAAC CCTAGCGGGA TGACTGCTGA TGAATTACGG GCAAAAATTT GGGGATTTGA TAGTAGCAAT GATGGCTGGC GTAAAATGTT GCAACGTACT CGCAAAGAAT TGCCCGATTG CGTTATTTCT GATGGTTCGA TTTATCGTTT GTGTTTCCCA TTGTCTGAGA TCGACGCTGA TATTTTGGTG ATCAATCAGA CTCCATTACA AGGCTCTGAT GCGACGCTTG ATCGTCTGCA AATAGCCGCC GATTATGCAA GCCAAGTGTT TTTATCGGGC CATGAGGCTC CTTGGATTGA ATGGGAACGT AAGCAGTTAT CCAAACGTGG AGCCGAAATT TGGATTTCAA TCGGCATTCA ATGCTATGAT GCCCTGCGCT TTGATCAGGC TCAAAAGGCC TTTGCTCAAG CATTGAGACT AGATATTTCG AATGGGCGTG CGGTTTCGCA GGCAATGAAC TTGGAGATTA ACCAAGGTCG TCGGCTTGAG GCCTTGGCAA TCTATGACCG TTACCGTGAG GCCTTGTTTG AAGAATATGG GCTAGACCCT TCGGCAGAGC TGCAAGCTTT GCAAAAACGT GCTTTGGATT AA
|
Protein sequence | MSFYPTYVER PRLLTLTQHA PRVVLIIGLP SSGKSTVLEA LKHSINAPSA LIRLEESAHD PLALLQAICQ QLGLPSDHAH EQLLRLRQPS WILIDDLDRI LDAWGTFLPE HAWLSELLKN PALHIAATSL HLPNLPMIQQ SIIRRQLRIV GMQELAFTSA ELIELAQQRQ LHLTHAQAEQ LIALTDGWVA PCIFALNRSG VLNQSAVEWV MQDVMASLPA PLHQMLEAWL WLAPDSTELL QALLPHYDLH RLMMSWEYAG LLAQSSLQLQ RLPEVGLRQR SDPQASYLSP WFEQAGDWYL AQQQLLPFVQ KMQRLQRWDV INHSLTRNLA LIDRQQFFAE IVNALNAVPV QQLSSKVGWL LVRSYHLGLR NGEKALLLVN QLLGIHQQPD DQRMLRLLKA DMLRAQGAIV EADGLIQPYL DDPDLSPADQ ARVLRTHASY LVSLGQDDNA ITHFERAYGY IQHSGINRLL GLILGDYANA AARAGRYSLA DRLLRQATLL WNELNSPAGL SQTMNLRAVV ALHRGQIRDA ARYAQEALDN ALISDNRAAN GALITLGDVA LADRDWTSAI ARYRSAREQL RNSNMVDYHI LTYALALESQ ATRHSSIEQL QRLLIEIDTT TAQNPLDQAW LAVARSAANL TLKLDGTIPL LQRALAGLDS EGALAKGLLY LLLSEAFWQR DQFGEARQAW EALDKLIIDG RSGLPVLLSA LTVHLPELVH EAYTQWNSPF AARVLHHSLP MPQAPKLIIR LMGQVRVEMH GKPVKIPKQG ILLIALLLMN PSGMTADELR AKIWGFDSSN DGWRKMLQRT RKELPDCVIS DGSIYRLCFP LSEIDADILV INQTPLQGSD ATLDRLQIAA DYASQVFLSG HEAPWIEWER KQLSKRGAEI WISIGIQCYD ALRFDQAQKA FAQALRLDIS NGRAVSQAMN LEINQGRRLE ALAIYDRYRE ALFEEYGLDP SAELQALQKR ALD
|
| |