Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0525 |
Symbol | |
ID | 5732442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 611470 |
End bp | 612483 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277652 |
Product | hypothetical protein |
Protein accession | YP_001543301 |
Protein GI | 159897054 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.137053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCAG GCATGCCGAG CAGAGCCGGC GTAAGCAACA GGGTGTGTTG CACACTATTC GTCAACTTGT GGCGAAACGT GCAACCGATT CTTCTGATGA TGGAAAGTAG GCGTTGTATG GCACCACAAT CTGATCGTAT TCAGGTGCGA TGGCGCAATG GGTGGACTGC AATCCCCCAT ACCATCTTGC GTGATTCGCG ATTATCACGT AGTGCGCGAT TACTGTGGGG TATTCTTGCA TCCTATGCAG GAAACGAGGA AGCTGCATGG CCCCAACAAA AAGACCTAGC CCAAGCATGT GCTGAAGATG ACAAAGTACC CCATATTCGG ACAATCCAGC GCTGGCTTAA AGAACTTGAA ACTTATGGCT GGTTAGCCTC AATTCAAACC CGCCATGGCA ATATTTACGA ACTGTTTGAA TCTCAGCAAC GCGACACCAG TGTCGAAGAG ACGACACCAG TGTCGTGTCA GACACGACAT CAGGATCGCG TCAGAGGCGA CACCAGTGTC GTGTCAGAGG CGACACCAGT GTCGCATGGT TTAATTAACA AGAATCAATT AATAAGAAAT AATAATAATA CTCCTGCTGC GCAGCCAAAA AAACCAACGG ATACTGAAAC CTATCGACTA CTTCGGGATC GGCAAGTTTG GACGGCAAAA AAACATGCCC ATGAGCCGCT TGAAATAATT CAAGCCTATC TTGACCATGT TGGGCCAAAC TACCCCGCCG CCCAAATCGC CCTTGACCTC AAAGCAGGCG TTCACCATCG CCTCACTCCT GAGCCAGAAC CATCACCACC GCACACCGAA CCAGCCAACG ATGATCGGCG GCCTGCATGG ATTGGACTCG ATCAATGGCA AGGCCTGACC TCAAATCAAC GTGATGCCTT GCAATATGCC CAACTGGTGA ATGGCCGAAT TCAAGCCCAA TACGATGATT GGACTGAAAT GATCTACAAG CGCTGGGGGC CATTGGCCAA CAGGCTGGTT GAAGCTGCTG GGGGGACACA ATGA
|
Protein sequence | MNSGMPSRAG VSNRVCCTLF VNLWRNVQPI LLMMESRRCM APQSDRIQVR WRNGWTAIPH TILRDSRLSR SARLLWGILA SYAGNEEAAW PQQKDLAQAC AEDDKVPHIR TIQRWLKELE TYGWLASIQT RHGNIYELFE SQQRDTSVEE TTPVSCQTRH QDRVRGDTSV VSEATPVSHG LINKNQLIRN NNNTPAAQPK KPTDTETYRL LRDRQVWTAK KHAHEPLEII QAYLDHVGPN YPAAQIALDL KAGVHHRLTP EPEPSPPHTE PANDDRRPAW IGLDQWQGLT SNQRDALQYA QLVNGRIQAQ YDDWTEMIYK RWGPLANRLV EAAGGTQ
|
| |