Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1096 |
Symbol | |
ID | 5732987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1255070 |
End bp | 1256815 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278234 |
Product | hypothetical protein |
Protein accession | YP_001543872 |
Protein GI | 159897625 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0122577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTGC GCTCATTGCG GTCGTTGTTG ACTGTTGATC GTGCTCGGGT CGGCTCATTA TTAAGTGTTT TATTGGTAGG CTTGTTGGCT TGGATGCTGC CGCAGACCAA TGCTCAGCCT GCTTTCGCTG CATCAAGCAC CGTCGTGATT AGCCAAGTTT ATGGGGGTGG CGGCTCTGCT ACGGCTACTT ACAAGAGTGA TTACGTAGAA TTGTTCAATT TGAGTGGTTC TGCTGTCTCT TTAAATGGCT TGTCGATTCA ATATGCTTCA AGCACAGGGA ACTTTAATGG TGTTTTCGCT TTACCAAATG CTACAATTCT ACCTGGCAAA TACTATCTCG TACAGCTATC TCTGGGTACA GGTCTAGGCG ATATCCCAAC TCCAGATGCA GCTTCTGGAA CTAATATCGC TATGTCTGCA ACTGCAGGTA AGGTTATTAT TGCGAATACC ACTACAGCAT TAGGGTGTTC CACAAGCGCG ACTTGTACTC CTGCTCAACA AGCCCAAATT ATTGATCTCG TTGGTTATGG CACAGCTGCT AATTACTTCG AAGGTAGTGG GCCAACAGGT GCGCCAAGCA ATACGACGAG CGTTATTCGT ACCAATCCTT GTGTTGATGC CGATAATAAT GCAACTGAAT TTAGCGTGGG TACGCCAAAC CCACGTAATA CTGCCAGCCC TACGTTGAGC TGTTCAGCGG CCACCAATAC ACCAACGAAC ACGCCGACTA ATACCGCGAC CAACACACCA ACCAGCACTC CAATTGTGCT TGGGGGCGAT AATAATATCC TGTGGGATCA GCTCTATCAC AGCGCCACTG CTGTAAATCC TCAACTTGAG CTTGTGCCAA ACGAGAGCTA CAGCTTTTTG CATAGTGCTA GTGGCACAAT CGACGAAACC ACGGCTGTGA CGATTTCGGC ATTAACTGAT GCGCTTGATG TGCAAACGGT TAGCCTGCGC TACTGGGATG GAGCGAATTC GACTACAATT CCAATGACGA GGATTAAATC GTTGAGCGCT AGCTTTCGCA GCCAGCCAAT CCATAGCTAC GATTTGTGGC AGGCTAGCAT TCCAGCTCAG CCAATCGGCA CAAGCATTTT CTATCGGGTG ATTGCTCAAG ATGGTTCGGC CTCAGCCTAT TTGAAGCACA ATAATGGCCA ATATGTGAAT CCGCTTGGCC AACATGTGCG GGGCTTCAAT GATGATCCCG ATGATTATAG CTACACGGTT TTAGCGGCAA ACCCAACTGC TACCCCAACG AATACCCCAA CTAACACGCC AACGGATACC GCTACGCCGA CGGCGACCAA TACGCCAACC AATACACCAA CCGATACGGC AACGCCAACG GCGAGCAACA CGCCAACCAA TACGCCGACC GATACGGCAA CACCAACCAA CACGCCAGTG GCTCCAACGG CAACCGATAC CGCTACGCCA ACGGCGACGA ACACGCCAAC CAATACGCCG ACCGATACGG CAACGCCAAC CAACACGCCA GTGGCTCCAA CGGCAACCGA TACCGCTACG CCAACGGCGA GCAACACACC AACCAATACG GCTACGCCAA CGATCACGGT GACGAGAACA CCGACACATA CGCCAACTAA TACAGCAACG CCAACGCGCA CGGCGACCAA CACGCCAACC AATACGGCGA CATCGACGGC GACGAATACG CCAACCGTCA CCAATACGCC AATTGCTCAG CAGCATAAAG TGTTCTTACC ATGGGCCAGC AAATAG
|
Protein sequence | MQLRSLRSLL TVDRARVGSL LSVLLVGLLA WMLPQTNAQP AFAASSTVVI SQVYGGGGSA TATYKSDYVE LFNLSGSAVS LNGLSIQYAS STGNFNGVFA LPNATILPGK YYLVQLSLGT GLGDIPTPDA ASGTNIAMSA TAGKVIIANT TTALGCSTSA TCTPAQQAQI IDLVGYGTAA NYFEGSGPTG APSNTTSVIR TNPCVDADNN ATEFSVGTPN PRNTASPTLS CSAATNTPTN TPTNTATNTP TSTPIVLGGD NNILWDQLYH SATAVNPQLE LVPNESYSFL HSASGTIDET TAVTISALTD ALDVQTVSLR YWDGANSTTI PMTRIKSLSA SFRSQPIHSY DLWQASIPAQ PIGTSIFYRV IAQDGSASAY LKHNNGQYVN PLGQHVRGFN DDPDDYSYTV LAANPTATPT NTPTNTPTDT ATPTATNTPT NTPTDTATPT ASNTPTNTPT DTATPTNTPV APTATDTATP TATNTPTNTP TDTATPTNTP VAPTATDTAT PTASNTPTNT ATPTITVTRT PTHTPTNTAT PTRTATNTPT NTATSTATNT PTVTNTPIAQ QHKVFLPWAS K
|
| |