Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1905 |
Symbol | |
ID | 5733794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2299446 |
End bp | 2300759 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279049 |
Product | hypothetical protein |
Protein accession | YP_001544676 |
Protein GI | 159898429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0772428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGAG TAGTGTTGTG GATGCTAGCA TGTTTGTTTG TGCTGGGGTT TCAGGCTACG GCTCAGGCGG CTGAGCCAGC CAAAATCTCG CGTAACCAAG CAACAAGCAG CTTTGGCGAA AATGTGGTTT TTGAGCTTGA AGCTGAATCA AGTTCGCCAA TTCGCGAAGT GACGTTTTTA TATGCCTTGG GTGTGCAGCC AGGCGATGTG CCAGCTTACA CCGAGGCCGA GGCCAAATGG CAACCTGGTA GCTCGATTGA GGCCAGTTTT ACCCGCGATA CCAGCATTGA ATTTTTGCCA GTTGGCGTGA CGGTGCGCTA TAAATGGCAG CTGGTTGCCG AAGATGGCAC GATTACTGAA ACACCTGAGC AATCAGTCCA ATATCAAGAT ACCCGTTTCA ATTGGCAAGA AAAAAGCTCA CGTGGGATCA CTGTGCGCTG GTATGATGGC GATGAGCAAT GGGGACAAGA TTTGCTCGAT AGCGCACTTG GTGGGCTTGA TCGGCTTGAG CAGCGGATTG GCGGTTCGGT CGAAGATCCC ATGACGATCT CGATTTATAG CAATACCCGC GATATGCGCG GCGCTTTGCC ACCCAACTCA GCCGATTGGA TTGGCGGTCA AGCACGGCCT GACCTTGGCT TGATTATTGG GTCGATTGAT GCTGGCGATG ACGCTGAATT AGGTCGTTTA GTGCCGCATG AATTAAGCCA TTTGGTGCTG CATCAAGCAA CCAACAATAA TTATGGTGGT ATGCCAGTTT GGTTCGATGA AGGTTTGGCG GTTGCCAACC AAGATTCGCC CGACGCTGGC TTTAAGCAAA TGGTTGAGCG GGCTGCCGAA AATGGCGAGT TGATTCCGTT ACGTGCTTTG GCCTCGAATT TTCCTTCCGA CCCTGAAAAA GCCCTGCTTT CGTATGCCCA AAGCGAAAGT GTGGTGCGTT ACATCGAATC AACTTATGGC ATCGAGGCGA TTACCAAACT CGTCGCTCAA TTTAAAAGTG GCGTAACCGA TGATGTGGCG GTGCAAACTG TCTTGAATCG TAGCCTTGAT ACCTTGGATA GCGAATGGCG CAGCACCTTG CCTGAAGCGC AAGGCTCTGG CCCAGCCCAA ATCTTGCCCG ACGATACCGC TCCAGCTGAT CGATTTAGCG AACAACCACG ATCCTCAGCT CCTAGCAACC CAAGCGCACC CAATAGTCCA GCGGCAACCC CATCAGTGCC CTTGTGGATT TGGCTAGCAG GGATTGGGGG CTTGCTCTTG ATCGTTTTTG GTACGATTTG GATTATTCGC AGCAGTCGCC AACCACGCTA CTAA
|
Protein sequence | MRRVVLWMLA CLFVLGFQAT AQAAEPAKIS RNQATSSFGE NVVFELEAES SSPIREVTFL YALGVQPGDV PAYTEAEAKW QPGSSIEASF TRDTSIEFLP VGVTVRYKWQ LVAEDGTITE TPEQSVQYQD TRFNWQEKSS RGITVRWYDG DEQWGQDLLD SALGGLDRLE QRIGGSVEDP MTISIYSNTR DMRGALPPNS ADWIGGQARP DLGLIIGSID AGDDAELGRL VPHELSHLVL HQATNNNYGG MPVWFDEGLA VANQDSPDAG FKQMVERAAE NGELIPLRAL ASNFPSDPEK ALLSYAQSES VVRYIESTYG IEAITKLVAQ FKSGVTDDVA VQTVLNRSLD TLDSEWRSTL PEAQGSGPAQ ILPDDTAPAD RFSEQPRSSA PSNPSAPNSP AATPSVPLWI WLAGIGGLLL IVFGTIWIIR SSRQPRY
|
| |