Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0831 |
Symbol | |
ID | 5732732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 941899 |
End bp | 943257 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277963 |
Product | hypothetical protein |
Protein accession | YP_001543607 |
Protein GI | 159897360 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGAG ACCATGCGAT TGTGATCGGG TCCAGCATCG GTGGATTATT GAGCGCCCGT GTGTTGAGCG AGGTGTATAC CAAAGTGACG ATTTTGGAGC GCGATACACT GCCCGATGGG GTGCAATTTC GCGCGGGCGT GCCGCAAGGT CGTCAATTAC ACTTACTACT GATGCGCGGC TTACTCGAAT TAGAAAAACT GTTCCCCAAA TTGCGCGAAG ATATGAAGGC GCTTGGCGGC AACGAGCTGG ATCTGATGAA CGATTGTCGC TATTTATTTG CAGCGGGTTG GGCCGAACCA CGACCCTCGG ATTATCGCAG CATCATCGCC ACCCGCCCCT TGTTCGATGC AGCGATTCTT GGGCGGATTC GCCAAATTCC CAATATTGAA ATTCAAACGC GCCAAGAAGT GACGGATTTG CTCTTTCACG GGCGGCAAGT TGTCGGGGTC AAGCTGCGCA ATCGTGATAC TGGTGAGCAG CATGAACTCA ACGCCAACTT GGTTGTCGAT GCAGCAGGCC GTCCATCCAA GCTGCCCGAA TGGCTGGAAC AGCATCAATT TGGGCGCGTC AAAGAGACCA TTTACAAATC GGATGTGGGC TATGCCACCA GAATGTACAC TGCGCCAGCC GATTGGCAAG GCTGGCCAGC GATCTTTGTG ATGCCGATGC CGCCGCATAT TCGCCATGGC GCGACCTTCC TTAAAGTTGA AAACGATCAA TGGCTCTTGA CAATTGGCGG AGTTGGCAGC CAACGCCCAC CAACCGATAG CGCCGAGTTT GAAGCTTGGT TGGCACGGAT GCGCACACCA TTGATTGCTG AAATTACCCA AAAATTTACG CCTGCCAGCC AAGTGTATGG TTATGCCAAA ACCGAAAATC GTTGGGTTCA TTACGAACGA CTACCAGTAA TGCCCGCTGG CCTGATTGCC ATTGCCGACA GCGTGTGTGC CTTGAATCCA ATTTATGGCC AAGGCATTAC CAATACTGCC ATGAGCGTTC AGGTTTTAGC CAAGCATCTG CAACGCGAGG GCATTGGCAG CGCCCCCTGG GCCTTGATCT TCCAAAAAGC CTTGGCTCGC CGCAACAAAT CGGGCTGGCT CACCACCATA AGCGAGGATT CACGGCTGGC TGATACGCCG ACCTATCGCC CAATTTTGCC AATTCGCGTG CTGCAATGGT ATAGCGATAA AGTGAATTTA GCAGCGGCAG GCAATCGCGC TGCTACCAAC GCGATGGTTG GCGCGATTCA TATGATCGGC TCAGCTGCGA GCTTATTTAC GCCCAACGTA TTGATTGCCG CCTCGCGCTT TTTCCGTTCG CAACCACCAA GTAGCGTGCC AGCAGCGCCC GATAAATAA
|
Protein sequence | MRRDHAIVIG SSIGGLLSAR VLSEVYTKVT ILERDTLPDG VQFRAGVPQG RQLHLLLMRG LLELEKLFPK LREDMKALGG NELDLMNDCR YLFAAGWAEP RPSDYRSIIA TRPLFDAAIL GRIRQIPNIE IQTRQEVTDL LFHGRQVVGV KLRNRDTGEQ HELNANLVVD AAGRPSKLPE WLEQHQFGRV KETIYKSDVG YATRMYTAPA DWQGWPAIFV MPMPPHIRHG ATFLKVENDQ WLLTIGGVGS QRPPTDSAEF EAWLARMRTP LIAEITQKFT PASQVYGYAK TENRWVHYER LPVMPAGLIA IADSVCALNP IYGQGITNTA MSVQVLAKHL QREGIGSAPW ALIFQKALAR RNKSGWLTTI SEDSRLADTP TYRPILPIRV LQWYSDKVNL AAAGNRAATN AMVGAIHMIG SAASLFTPNV LIAASRFFRS QPPSSVPAAP DK
|
| |