Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0109 |
Symbol | |
ID | 5732002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 141655 |
End bp | 143475 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641277231 |
Product | hypothetical protein |
Protein accession | YP_001542889 |
Protein GI | 159896642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTT ATGCAAAGGC TTGTATGGGC ACATTTAAAT TAACATCAAT CAAAGATCGG TATATTATTG TATTTGGAAC ATTTTGTATA TTTCTATGCT ATGCACTATT TATCTTCAAT TCTGATGTGG CTTATTTAGC CTTTACTCAA CCTCAAGATG ATTCGTTTTA TTATTTCCTA CCGGCCTGGA ATTTCAAAGC CTATGGATTT TATACCTTTG ACGGTTTGAA TGAAACATAC GGCTTTCAGC CATTATGGAT GGTTTTACTA ACCATAATGG CTTTTTTTAT ATCAAGTAAA ATTCTATTTT TCAAGCTTTC GCTTCTAGTG GGATGCCTGT TCTATCTCTT AACTGGCTTA ATTATCTACA AAATCAGCAA GTTATTGATC AGGCAACGGC TACTAACATT TATTCCATGG TTTGTTTGGG TTAGCAATAT CTACTTATTA CGTATATTTG CTTCAGGTAA AGAAAATTCA CTCTCAGTTT TTTTATATGC ACTGATTGCT TTAAGCCTTC TCAATATCCA TTTTAAAAAT CGCCAACAAT CAAGCCTGTA TGGGTTAATC GATGCTAGCT TTATGCTGGT TCGGATCAAC AATCTGATGT TTGTTGGGCC AATTCTGCTC TATCGGCTCT ATCATAATCG TCAGCAAAAA TCCCAGATTG GATTCTATTT AGCAACCTTT AGTTTAGTAT TAAGTCTATG GTTTGGCTAT AGCTATGCTG CATTTGGTAC GCTTTTTCCC AATAGTGGCA GCTTAAAGAC AGTTATTTCT AAACCTTCAA TCGTCTATTG GCTTAATCAA CAAACTGGCG TTGAGCTGGG TGGCATGCTT AGCTCACAGG AACAATTACT CTTACAGCAT CCTGAATTTT TAGATGTTCC GCGGGCGAAT TTCTTCTGGC AATATCTTAG TCAAATTGTG CCCCAAAAAG TTACAGATAT TTATTTTGAT CAAAAATTCT CGTCAATAAC GCTTGTTTCA ACGCTCAATC AAGTTCTTTT GCTGAGTATT GGGCTTGGAT TAATTGGATT TTTAGGTGGA TTATGGCAAC GAAGCATTAC CATCAATCAA CAACTAATCA AATTATTAAG CTATTGTGGC ATCATTGCCT TAGCAAATAG CGTGGTTAAT TGGCTATTTT TTCAACGTTA CCTCAGCTTT ACGATTTGGT ATCCAGTACC AGAATTATTT TGGTTTAGCT TGGTATTGGG TTTATTAGTG GTAGCAAGTA TCATTGGTTG GCAACAATTA AGCCAAATCA AACCCATCAT TAAGCCAGTG CAATACATAA TGTTGCTAGC AATTGGCATA TTTTTATTAA GCCCATTGAG TCGTTTTCCA CAGGAGCTAT TACCGCAAAA AACCAGTGAA CATTATCGTG GAACCTATCG ATTTTTCTCG TATATTTGGC AGGATGAAGC ACTCAAAGCC ACTAGTTGGG CCAACCAAGC ATTAGCGCCA AATACGACTA TTGGTTCGTG GAATGCAGGG ATTGTTGGCT ATTTTTATGA AAATGGTTCG ACGATTAATT TAGATGGCTT AGCAAACAGC CCAGCGTTTG TCGATGAGGT TTTACGCCAG AATATTTTAT TTACACGTGG CTTAGCAAAC GAAAATGTGC TATGGAACTA TATTCAACAC CAGGATATTC GCTATATTAT TGATTCATGG TATAGCGGGG AAATGGGCAA AAGCAGATTT ATTAATAGTA TTCCACCAGA GCATTATGAA ATTATCTACG AGGGTGCGGT TACGTTTTCA GATGGGAATC GGCCTGATCG AAGAATGTAT GTATTGAAAT TGAAGTATTA A
|
Protein sequence | MDFYAKACMG TFKLTSIKDR YIIVFGTFCI FLCYALFIFN SDVAYLAFTQ PQDDSFYYFL PAWNFKAYGF YTFDGLNETY GFQPLWMVLL TIMAFFISSK ILFFKLSLLV GCLFYLLTGL IIYKISKLLI RQRLLTFIPW FVWVSNIYLL RIFASGKENS LSVFLYALIA LSLLNIHFKN RQQSSLYGLI DASFMLVRIN NLMFVGPILL YRLYHNRQQK SQIGFYLATF SLVLSLWFGY SYAAFGTLFP NSGSLKTVIS KPSIVYWLNQ QTGVELGGML SSQEQLLLQH PEFLDVPRAN FFWQYLSQIV PQKVTDIYFD QKFSSITLVS TLNQVLLLSI GLGLIGFLGG LWQRSITINQ QLIKLLSYCG IIALANSVVN WLFFQRYLSF TIWYPVPELF WFSLVLGLLV VASIIGWQQL SQIKPIIKPV QYIMLLAIGI FLLSPLSRFP QELLPQKTSE HYRGTYRFFS YIWQDEALKA TSWANQALAP NTTIGSWNAG IVGYFYENGS TINLDGLANS PAFVDEVLRQ NILFTRGLAN ENVLWNYIQH QDIRYIIDSW YSGEMGKSRF INSIPPEHYE IIYEGAVTFS DGNRPDRRMY VLKLKY
|
| |