Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0938 |
Symbol | |
ID | 5732824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1073141 |
End bp | 1074901 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278070 |
Product | hypothetical protein |
Protein accession | YP_001543714 |
Protein GI | 159897467 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000915423 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGGC AGCACTTTCG CTATCAACGC CTCGCACTAC TCAGCCTTGT GGCTTTGCTG GGAGTGTTAG TCACACCAAC CAAAGCCGTC GAAATCCCGC CAACAGGCGC ATCGGTGTAC CTCCAAACCA ATGGGCCTAC CAATACCTTG AATAACGGTG ATTGGTACAC CAATAGTCTG GCAGGTGCAG GTAATGGTTA TCATTACTTT ACGGTTGATA TTCCATGTGC TTGGCCGAGC ACTGAGCCAG TCCATATCGA TATCTTTAGC CCTGAAATGA ATAGTAATGC CCCCCTCAGC GACGAGATTC GTGGTGGGGT GTACGATAAT ACGCAATTTG AGTTCTATGC TGCTGGCACG CCGATTGTTG TTCCAGCAAC GCCTGGCCCA GGTGCGGCTG GTAGTTTGAT TCAACAAACC TTTGTGCCTG CTGGCACGCC TGAGGCATGG TTGCGCTTCT ACACAATTGC CGCACCTGTC ACTTGTGGTA CGTATGTTTT GCGTTCAGCC ACTTCCGGCA ACGACGAAAA TGGTTGGCGC TTGCGGGTTG GCCGCGATAA CGACGCTGAT CCCAATAACG CGCCACCAGC CAACACCGAT AATTTCGATG GTGTAGCTGG TACTGGCGAT GAAATTACCC TTGGGATGCG TCAGGCTTCG TTCCAACACG ACGCTGGTGC AGCAGATGTT GTCGCCACAT GTTTAACCTT GTATGAATAT GTTACTCCTG GTCAGCCCAG CGTTAGCTTT AATAATTTTG ATATCGATAA TGTCCGTCGG GTGCGTTATT ATGCGCCTGG TGATGCGAGC TACACGCCTA TGGGCAATAG CGGTGGCATC GTGGGCAGCC TTAGCAACGA CCAAATTTGG AATGGCACTG GCGCAACCTT AGCCACCCGC GTCGGCGATA CGATCAATAA CCCAGTTTCA GGCTGGTGGC GGATCGTAAC CTGTACCAGC AATCACAATC AGTTTATTCA AGAAGGCCAA ACTGGCACAC CAGCCTACTA CGAACAGCCA CCAACGCCAG TCATGGCCTT GAGCAAAACC GATGGCGTTA CCTTGGTCTT ACCAGGCGAT ACGCTGAATT ACACCATCGC CTTTACCAAT ACTTCCAACA GCACGGCCAC GCCAGGTAGT GCAACCAACG TTACCTTAAC CGACAATTTA CCACCCGACA CGACCTTTGT CAGTTGTGCG ATTAATCTGC CATTCACTGG TACATGTAAT CATGCTGCTG GTGTGGTGAC CTTCAATATT ACCCAAATAG TTCGCCCAGG CGAAGTTGGC ACACTCAACG TTCAGGTAAC CGTCAACGAT CCGATCACCA CGGTTCCGGT GGTCAACAAT GTTACCTTGA CCTTCAACGA TACCTTGAAC AATGTGTTCC AACCATTGAA CGCCAGCGAT AGCGATTTGG TCAATCCAAC TGCGGTAACT GTGGTTGGCT TCACGGCCTT GGTACGGGTC GATGATATTC AAGTGCGCTG GAGCACTAGC CAAGAATTGG AAACCCAAGG CTTCCATATC TATCGGAGCA CCAGTGATGA CCCAGCGACC GCCGTCCAAG TGACTGAGAA CTTGATTCCA GCCTTGGGTG CGCAAACCAA CTATCAATGG CTTGACACCA ACGCTGAGCC AAATGTGCAT TATTACTATT GGTTGGTTGA AGTCGATGCC AACAATAATT TGAGCATGAT CGGCCCAACC GATGCACAAA TCGAGCGCTA CAGCATTTTC ACTCCCTTCG TTATACGCTA A
|
Protein sequence | MGRQHFRYQR LALLSLVALL GVLVTPTKAV EIPPTGASVY LQTNGPTNTL NNGDWYTNSL AGAGNGYHYF TVDIPCAWPS TEPVHIDIFS PEMNSNAPLS DEIRGGVYDN TQFEFYAAGT PIVVPATPGP GAAGSLIQQT FVPAGTPEAW LRFYTIAAPV TCGTYVLRSA TSGNDENGWR LRVGRDNDAD PNNAPPANTD NFDGVAGTGD EITLGMRQAS FQHDAGAADV VATCLTLYEY VTPGQPSVSF NNFDIDNVRR VRYYAPGDAS YTPMGNSGGI VGSLSNDQIW NGTGATLATR VGDTINNPVS GWWRIVTCTS NHNQFIQEGQ TGTPAYYEQP PTPVMALSKT DGVTLVLPGD TLNYTIAFTN TSNSTATPGS ATNVTLTDNL PPDTTFVSCA INLPFTGTCN HAAGVVTFNI TQIVRPGEVG TLNVQVTVND PITTVPVVNN VTLTFNDTLN NVFQPLNASD SDLVNPTAVT VVGFTALVRV DDIQVRWSTS QELETQGFHI YRSTSDDPAT AVQVTENLIP ALGAQTNYQW LDTNAEPNVH YYYWLVEVDA NNNLSMIGPT DAQIERYSIF TPFVIR
|
| |