Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5036 |
Symbol | |
ID | 5736995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 49805 |
End bp | 51475 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641282203 |
Product | hypothetical protein |
Protein accession | YP_001547794 |
Protein GI | 159901548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0701972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCTA CCTGGAAATA CGGTCGTATC CTTGGGCATG GAGTCATTGG CAGGCTGTTC GGCATGATCC TGCTAAGAAT TTCTGAACGC GAGGCATCGA TCATTCTTCA TCGAATCCAA GCCGCGCTCT ACTCTAGCAG ACCGTTCTAT AGTATCCACC ATCGCTACAA TGCTGCAACG GATTGGCTGT TCGCATACTT TTTTCCGGAC AACCTTATGG GTGACCTCCT CATTATCTTT CTGAATCGAA GGAGACCTAT GGCTTTCATT ACATCTTCAG ATCTTTGTTT GGTGCTGATT CATCGTTCAC TCCGTGGCCG ACTCTTGGTT TGGCAGGCAG CGATTATGGT GGCTATGGCG CTGCTCCCTG GGAATAGCCT CGCCGCTAGC CCGAATCAAA GCTACGAGGT TGGCCCTGGG CGGACATACG CCCGTCTTTC CGATCTGGTC AGCGCCGACG TGCTCGGCCC TGGCGATACA GTGCTGGTCT ACCCGAATGG GACGGCATCC TACAATGACA CTGTGATCTT CGACACCCAT GGCACGGCTG ATCGCCGGAT CACGATCCGC GGGGTGCGAG TCAACGGACA GCGCCCTATC CTGTCCAGCA ACAACAACTA TGGAATCGTC TTTAAAGGTG ACCATTATAT CTTTGAGGGA TTTGAAGTCA CGGGTGCCGT CGGGAACCAG TATGTGATCA TCCACCGTGC AGACCACATC CTCATCCGTG ATACACTCAT CCGTGACTGC CCCGGTACAG GACTGCTCGG CCACGACGAG GACGCAGGCT CCCTTACGCT CGATCACGTT GAGGTAACCA ACTGCGGCAA CGGCCTCTAC CAGCATCCCA TTTATATGAC AACTGGCTTG CCCGGCGCGG TGTTTCGAAT GCAATACTGC TACCTCCACA ACCAGAAAGG CGGCAACGGT GTCAAGAGCC GTGCCAACCG CAACGAGATC TACTATAACT GGATCGAGGG CAGCTACTAC CACGAGCTGG AGATGATCGG TCCCGACGAT GGCACGGGCG GCTCGCCCGA CTCGCCGCGC CACTCGGATG TGGTCGGCAA TGTCTTTATC AAGAAGCAGG ACTTTGCGAC GCTCGTCCGT ATTGGCGGCG ACGGGACAGG ACAAAGCTGG GGGCGCTATC GTTTCGTCAA CAATACCCTA GTTGGTCGCA GTGATGGAGC CGTCGCCATC CGGGCCTTTG ACGGGCTGCA AAGTATCGAA CTGCACAACA ATGTCTTCAC CAACGCCAAC GGCACGGGAA TGCGTATCAT TAGAGACACC GAGGCCACAT GGCACAATGG CTTGCGGGTA GTTGCTGGGA TCAATAACTG GATTCAAGCA GGCTCGGTCA GCGCCCCAGA ACTCATCGGC ACTATTCAGG GCACCGATCC TCAGTTCGTA AATCTGGCGA CTGGCGATGT TCGCCCGTCT ACGAATAGTC CGCTGATCAA CGTTGGGACA TCCAACCCCG CCAGCCCAAC TGGCTACCCG TTCCCCTCGC CGCTCATGCT GCCGAAGCAA CATCCGCCGC TCCGGACTAT TGCGCCGGTA ACAGTGGTCG ATGCGCGGCC AGTCGTCGGT GCAATCGATG TTGGTGCCTA CGAGATTGGG ATACCGCCAC TCTACACCAA TCGCGTCTAT ATCCCGATCA TAAAACGCTA A
|
Protein sequence | MAATWKYGRI LGHGVIGRLF GMILLRISER EASIILHRIQ AALYSSRPFY SIHHRYNAAT DWLFAYFFPD NLMGDLLIIF LNRRRPMAFI TSSDLCLVLI HRSLRGRLLV WQAAIMVAMA LLPGNSLAAS PNQSYEVGPG RTYARLSDLV SADVLGPGDT VLVYPNGTAS YNDTVIFDTH GTADRRITIR GVRVNGQRPI LSSNNNYGIV FKGDHYIFEG FEVTGAVGNQ YVIIHRADHI LIRDTLIRDC PGTGLLGHDE DAGSLTLDHV EVTNCGNGLY QHPIYMTTGL PGAVFRMQYC YLHNQKGGNG VKSRANRNEI YYNWIEGSYY HELEMIGPDD GTGGSPDSPR HSDVVGNVFI KKQDFATLVR IGGDGTGQSW GRYRFVNNTL VGRSDGAVAI RAFDGLQSIE LHNNVFTNAN GTGMRIIRDT EATWHNGLRV VAGINNWIQA GSVSAPELIG TIQGTDPQFV NLATGDVRPS TNSPLINVGT SNPASPTGYP FPSPLMLPKQ HPPLRTIAPV TVVDARPVVG AIDVGAYEIG IPPLYTNRVY IPIIKR
|
| |