Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2022 |
Symbol | |
ID | 5733911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2512278 |
End bp | 2514971 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279166 |
Product | hypothetical protein |
Protein accession | YP_001544793 |
Protein GI | 159898546 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC AACAATACGT GCAGGATGCT GTTGGTTGGC AACTTACGCC ACGCAGTATG AATCTTCAAG GTGTTTGTTT GATCGACCCA CAGGTTGGTT GGGCGGTCGG CGATGGTGGG TTAATTGTTT CGACCAACAA TGGCGGTACG TCTTGGCAAC AGCATGAGGT GACCGAAAAA GATTTACGCG CTGTTACCTT TATTAATCGG CTGCATGGCT GGGCGGTTGG TCGCGATGGG ATTATCATCG CGACCAACGA TGCTGGTGTT TCATGGCAGC AGCAAGCCAG CGGTTCTGAG CATAATCTCT ATGGTGTCGC CTTTGCACCT AACGCAAGCC ACGGCTGGGC GGTTGGCGAC GATGGGATTA TCTTGACAAC CCAGAATGGC GGTGAGACAT GGCAAACTCA ATCAAGTGGC CTGCATCAGC AATTAGCTCG CTGTAGCACT GCCGATGGTG TCCATGGTTG GGCCGTTGGG CATGGTGGGG TTATTCTCGT AACCAACGAT GCTGGGCAGA CATGGCAGCA GCAAACAAGC CCAACCAGCA ACGATTTACG CAGCGTATTT GCAGTTAATC AGCAGGTCGC TTGGATCGTT GGGCGTGCAG GAACAGTGTT GATAACCAAC GATGCTGGCC AAACATGGAA TCCTGTCCAT ACTCCAAGCA ACAAGGATTT CTATGGGATT GTGGCTGATG CAAGCGAATG CGCAGCTTGG ATTAGTGGCG ATGATGGCAG TTTCTATCGC ACCCACGATG CTGGGCAGAC GTGGCAAGCT CAACAGAGCA AAACCAGCAA AGATCTTGTA GCCCTTGCCA GTAGTCGCGA TCTCCTGCAT GGCTGGGCGG TTGGTGATGA CGGAATTATT ATTTCAACGA ACGATCATGG CAATCACTGG CAAATTCAAT TAAATGGTGC TGATCAGCAG CTCATGGCGC TGACCTTTGA TCAACATGGT CAAACCGGCT GGGCGGTTGG CCACGCTGGT ACGATTATCC ATACAACTGA TTATGGCCGC AGTTGGCGAA CCCAAACCAG TGGCGTAGCC CAGCCACTTT GGGCCGTCTG TAGTAATCCG ACAGGGACGA TTGGCTGGGC GGTTGGTTCG CACGGCACAA TTCTCGCAAC CACCGATGCG GGAAAAAGTT GGCAGCAGCA ACGTTCGCCA TCCTCAGTTA ATTACTACGG AATGTGGTTT ATCAATCAGC AGGTTGGTTG GATTGTCGGT GCAGCAGGTA CGGTATTATC GACGCATAAT GCAGGACGGG ATTGGCGACA AAAATTATTA GGATTTGGTG TTGCAGACTA TTATGCGATA TCATTTTCTG CCGATGGACA AGCTGGCTGG ATTGTCGGCG CACATGGCAC AATTCTCGCG ACCAATAATG CTGGCCAAAC ATGGCACCAA CAGCCGAGTG CTTGCGATGC AGCCTTGCTC GCGATCCAGA TTGTTAATCA GCAAGAGGTT TGGATTTGCG GCAGTAATGG AACGCTGCTC CAAACAATCG ATGCCGGAAA CAGCTGGCAG CAGCATGCTT GCGATCCGCA ACAGACCTTT ACTGATCTTG TCCTGATTGA TCAACAAATC TACCTCATTA CCCAATCTGG CCAATGTTGG TCAAAATTGA CGACAGACCA GCATTGGCAA ACCCAAACCC AGCTTAGTCG CCAAGCGCTG CGCTGGATCA CTCACAATTC GCACGCCGCA ATCACGTGGA TTGTGGGTGA TGGCGGTACA GTTCTAACCA GTCCTGATCG TGGCACCACG TGGACTGCGC AAGTCAGTGC TACCGGAAAA AACCTGTATG GGATTGATTT CGTGCCCGAT GGTCAAATTG GTTGGATCGT TGGTGATGGT GGCTCGCTGT TCAGCAAACA GGCTAGCGCA ACCAGCCTGC GTGTCCATAG TAGCCCAATA ATTAAAAACC TCTATGCAAT TCATAGTCTT GGTGATGGAA CAAAGGCATG GGCAGTTGGG CGTGAGGGCA GCATTATCCA TACGTCGGAT GCTGGACATA GCTGGATTGT TCAACCAAAT CCCTGTAGCT CTGATCTTTT AGCTGTCTAT TTTGCTGATG CGCAGCGTGG CTGGGCCGTT GGTGATCATG GCACCATTCT CGCAACCACC GATGGCGGAG CTTCGTGGCA ACAAGCACCA TTCGCTACTG ACCAAATTTT ATGGGGGCTA GATTTTGCTG ATGCACAGCA TGGCTGGGCC GTTGGTGATC ATGGCACCAT TCTCGCAACC ACCGATGGCG GAGCTTCGTG GCAAATCCAA GCCAGTGGCT GTGACAAAAA CTTCGCAGCC ATTCATTGCC AGCCGACATC GACCCACAGC TGGATTGCGG GTGATCGCGG GACGATTTTG CATAGCCGCG ATGCAGGCGG TTCATGGCAA CTCCAGCCAA CTCCAACCCA AAAACACCTA GCGGCTGTGC ATTTTTTGCC TGATGGTCAA ACAGGCTGGG CCGTCGGTCG TAGCGACACA ATTCTTATGA CTCGTGATGC TGGCAACACG TGGCACCAGC AAAGCACCAC CACAGGCATG AGCCTCTGGG GAGTTCATGG ATTATCGTCC AGTCATTATT GGGTAATAGG CGATGAAGGA ACGATTTGTG CGACCAACGA TGCTGGCTTA AGCTGGCAGC TTCAATCGCG TGGGTTTGGA AAACACTATC CCCAACCAAG CTAG
|
Protein sequence | MTDQQYVQDA VGWQLTPRSM NLQGVCLIDP QVGWAVGDGG LIVSTNNGGT SWQQHEVTEK DLRAVTFINR LHGWAVGRDG IIIATNDAGV SWQQQASGSE HNLYGVAFAP NASHGWAVGD DGIILTTQNG GETWQTQSSG LHQQLARCST ADGVHGWAVG HGGVILVTND AGQTWQQQTS PTSNDLRSVF AVNQQVAWIV GRAGTVLITN DAGQTWNPVH TPSNKDFYGI VADASECAAW ISGDDGSFYR THDAGQTWQA QQSKTSKDLV ALASSRDLLH GWAVGDDGII ISTNDHGNHW QIQLNGADQQ LMALTFDQHG QTGWAVGHAG TIIHTTDYGR SWRTQTSGVA QPLWAVCSNP TGTIGWAVGS HGTILATTDA GKSWQQQRSP SSVNYYGMWF INQQVGWIVG AAGTVLSTHN AGRDWRQKLL GFGVADYYAI SFSADGQAGW IVGAHGTILA TNNAGQTWHQ QPSACDAALL AIQIVNQQEV WICGSNGTLL QTIDAGNSWQ QHACDPQQTF TDLVLIDQQI YLITQSGQCW SKLTTDQHWQ TQTQLSRQAL RWITHNSHAA ITWIVGDGGT VLTSPDRGTT WTAQVSATGK NLYGIDFVPD GQIGWIVGDG GSLFSKQASA TSLRVHSSPI IKNLYAIHSL GDGTKAWAVG REGSIIHTSD AGHSWIVQPN PCSSDLLAVY FADAQRGWAV GDHGTILATT DGGASWQQAP FATDQILWGL DFADAQHGWA VGDHGTILAT TDGGASWQIQ ASGCDKNFAA IHCQPTSTHS WIAGDRGTIL HSRDAGGSWQ LQPTPTQKHL AAVHFLPDGQ TGWAVGRSDT ILMTRDAGNT WHQQSTTTGM SLWGVHGLSS SHYWVIGDEG TICATNDAGL SWQLQSRGFG KHYPQPS
|
| |