Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4676 |
Symbol | |
ID | 5736523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5973265 |
End bp | 5974830 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281840 |
Product | CHAD domain-containing protein |
Protein accession | YP_001547435 |
Protein GI | 159901188 |
COG category | [S] Function unknown |
COG ID | [COG3025] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAAG AAGCCAAATA CCGCGCAACG CAACCGATTC GCCCCAAACA ACTTGAAGCC ATTAATATCG CGCCGTTTAG CCTTGGCGAG CGGGTAACCG TCGAAATTCG CGATAGCATT TTGGATACCG CCAGCCGCCA ACTCGATCAG CAGCGCTACA CTCTACGTAT CCGCCGGATT GACCAACAAT TATGGCTAAC CTTGAAACTG CCGGGCAAAG TTAAAGGTGC TGTCCATCAA CGCCAAGAAT TTGAGCATGA AATTACGCCG AATGTGCGCG ATCATCCTGA ATTGTGGCCC AGCGATATTC GCGAGCCACT GCAGGCAGTA ATTGGCGACG AGCCATTATT GCCAATGCTG ACCGTGCGCA ATCGACGACG GCTCTGGCGG ATCGAGCGCG ATGGCCAAGA GATCGCCGAA TTGGCACTTG ATCGTGGCTC TTTGGTCAGT GGCGAACGTT CGTTGCCGTT CCACGAGATT GAAATCGAAC TCAAAGGCGT AGGCACTGCT GCTGATCTCG CAGTGTTGGC CTCGATTTAC ACCACGCAAC TGCCGCTGGA GCCAGAAACC CGCTCCAAGA GCCAGCGCGG AATGCTGCTT TTGAACCATG CCGAGCATCT CGAACAACTC AAACAAGCGG TTGATCGCAC ACCGATGGAG TCAACTAACA ATTTGGCCGA AGCAGGGCGC TCAATTTTGG CCAAACATGT GCTGAAATTA CACAAAGCTT GGCCAATCGC GGTAGTTGGC GATGATTCCG AGGGTGTGCA TCAGATGCGC GTGGCAACCC GCCGCTTGCG CACAATTTTG GCGATTTTGG GTGAAACCCT CTACGAACCA GAAATTGTCG CCAAATTGCG GCGAGGTTTG CGCCAATGGG CCAGCGTTTT GGGCGCGGTG CGCGATGCCG ATGTGTTTTT GGGTAAACTT GATGATCATC GCGCCGACTT GCCTGAGCAA GAACAAGCAG GCTATGCACC CTTGATCGAG TCGATCAGCC AGCAGCGAGA TGCGGCACGG GTTGAATTAT TGAGCTTTTT GGAAAGCCGC AAAGCCAGTA AATTTGCCCA GCGTTTGACC GACTTTGTAC TGACTGCCGG CGCTGGAGTG CGCGAGGCTG ATACTAGCGA GGGCCGAGTC GAGCGTAGTT TGGTGCGACA TTGGGTTGGC AGCACGGTTT GGAGCCACTA CGAGCGCATT CGCGCCTACG ATGTGATCTT AAATGATGCA CCAGTGGAGA GTTTGCACCG CCTGCGGATT GAAATTAAGC ATCTGCGCTA TACGCTTGAG CTATTTAGCG CGGCCTTGGC CGAAGAACAC GAATCGCTGC TTGAGCAATT AGTCACCGCT CAAGATTATC TGGGCGATTT GCAAGATGCC GAGGTGGCTT TGGCGGTGGT TGAACAAAGT TTAGCCGAAA ACCCTGAAAA CGGTGCACTC TTGGCCTATC AAGCTGCCAA GCAGCAAGAG CATGATCAGC TTCAGTTGAA TGCACCCAAA ATACTGCGCT CGTTGTTTGA TCTACCCTTC CGACGACGTT TGGCCTCGAT TCTTTCAAAG TTGTAG
|
Protein sequence | MEQEAKYRAT QPIRPKQLEA INIAPFSLGE RVTVEIRDSI LDTASRQLDQ QRYTLRIRRI DQQLWLTLKL PGKVKGAVHQ RQEFEHEITP NVRDHPELWP SDIREPLQAV IGDEPLLPML TVRNRRRLWR IERDGQEIAE LALDRGSLVS GERSLPFHEI EIELKGVGTA ADLAVLASIY TTQLPLEPET RSKSQRGMLL LNHAEHLEQL KQAVDRTPME STNNLAEAGR SILAKHVLKL HKAWPIAVVG DDSEGVHQMR VATRRLRTIL AILGETLYEP EIVAKLRRGL RQWASVLGAV RDADVFLGKL DDHRADLPEQ EQAGYAPLIE SISQQRDAAR VELLSFLESR KASKFAQRLT DFVLTAGAGV READTSEGRV ERSLVRHWVG STVWSHYERI RAYDVILNDA PVESLHRLRI EIKHLRYTLE LFSAALAEEH ESLLEQLVTA QDYLGDLQDA EVALAVVEQS LAENPENGAL LAYQAAKQQE HDQLQLNAPK ILRSLFDLPF RRRLASILSK L
|
| |