Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1920 |
Symbol | |
ID | 5733809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2314499 |
End bp | 2316658 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641279064 |
Product | hypothetical protein |
Protein accession | YP_001544691 |
Protein GI | 159898444 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC ACATTCGTAA GATAGAAAAT GATAACCGTA TTGCAATTGC GCCTTACAAT TTTGTGCCCC TACCGGATCG GGTTATCACG CTTGATCCTG ATAAAGATAA GATCGATCAC GCTAGGTATG CTAATAATAG AAATACTGGG CAAATTGACT GTAAACTGAC AACGGAAACG GATCTCTATA TTCGCAGCGG ATTAACTGCC AGCGAATTTG CAGCGTCGCA ACAAAATGCC CAACCCGTAG ATCCTCCCCA ACGTGCTAGT TTAGAAGAAT TAATCGCAGT ATTACAAACC AACCCGAAAA ATAAACCCCA ATTTTTCTCC ACTACCAATA GCAATCAACC TGTAATTCCG GCTAGCTCGT TGCGCGGCAT GATTCGTTCC TTGGTCGAGA TTGCTAGTTT TAGCAAAGTT GGCTGGGTGA GTGATAATCC TCAATTCTTT TTTCGAGCAG TAGCCGACAA TAAGAGATCT ATAGGGATCT ATTATAAAAA TAAAATTAAA AATGTTCAGG CAGGGTATGT TTGTAAAATT GGAGATAAAT GGGTAATATT CAAAGCGAAA GTTATTAATG GGAAAAATTT TTGTTGGTAT AAACCTTGGA ATTCCTTGAA CGACGATATT ATCAGAGAAA TTGGCATATC TATGTTTAAT GATAATGATT TTTCTATAGG TTATAAGCCA ATATATTTTG ATAATCCGAA TTTTGATGAA CACAGAAAAT TTATAACAGA TAGAATATCA GTAAGAAATA ATTTGAATAA TAAAGGATTT ATAACTAATA GTGGTAATAT GTTAGAAACA GGTGAGGAAG GAAATAGTAA TAGGAAGAAT TATTGTATTA TTTTTGAGAA ATCTCAAGAG CATAAATACA CAATCAATGA TCAAGCAATC TATGACTACT GTAATAATCT GACTGAATTT CAAAAATTAT TAGGCGAAAA AGGATGTCTG ATAAATGGTC GTCCTATTTT CTTCATTGAG CCTACTCAAG AAAAAGGGGA AGTTATTTAT TTTGGGCATA GTCCTAATTT TCGTTTGGCT TATCGTAGGG ATGATCTCGA TAAAGCAGTA TCAGCCTTGG ATTGTCTCCC TTCTGAAATC AGTAGTAACT TAAATATACC TGATCTGGCC GAATCAATCT TTGGTTATGT GCGTCAAACC AAAACAAATG ATCAGAATCG GGCTGCCGTT AACAATAGCT TGTCTCAAGA GGGAGAAAAA AAGCAAGCCT TAGCCGGGCG GGTATTTTTT AGTAATGCCC ATCTCTCGCC TGAACAAGGT GATGTATTAA AAGAAACACA CATTCCGCAG ATTTTGGCTA GCCCGAAACC AACAACCTTC CAGCATTATT TGGTTCAAAT AGATCCTGCG CAAGATCAAT TAGCGCATTA TTCAACCAAA GGTGCTGTAA TTCGTGGGCG TAAACTGTAT TGGCATCAAC AACCTTCAAA TGCTTATACG ACCGATATTA AACAGGTTGA AAAAGCTCAA ACCCAATACA CTTGCATTAC TCCTGTAAAC CCAGGCACAA CTTTTGAATT CACCATTCGC TTTGAAAATT TGAGCGATGT TGAATTAGGG GCGTTGTTAT GGGTGCTGGA TCTCGCTTCA TGTGCAGGAA AATATCGCTT TAAACTGGGT ATGGGCAAGC CTTTAGGCCT TGGTTCAGTG GCGATTAATT ATGAGTTGCA ATTGACTGAC CGCAAACAGC GCTATCAGCG CTTGTTTGCT CGTGATGGCA ATTGGGAGAC GGGTTTTGAA CCATCAAGCC AAGCGACCCA AACCAAGGCG ATTGATGCCT TTTGTAAGTT TATTGATGAT AACCTTGGTA TGGATATTGA GGAGACCGAA CAAATTCAAC AACTTAGATC ATTGTTAGCC TACCGTAACG GCGAAGGAAT TTTATTTGCA GGGTTTTCGA GTGGTGAAGA CTCAGCCTTA CGTTATATGG AGATTGAACG GAATACCAAG ATGTCGTTTT TGCCAAGTCA AGAGGATAAA AAAAATAAGG AAGCAATGAT CAATGAATAT GATGATCGTA GAGTTTTGCC AAAACCTTCG CAAGTTGCAC CGCCACCTCC CCGAGAATTA CCACGACCAC CAATCGCAGA TGATCCAATA AAGGTTACGG TGCGCCGTGT CACACGATGA
|
Protein sequence | MKKHIRKIEN DNRIAIAPYN FVPLPDRVIT LDPDKDKIDH ARYANNRNTG QIDCKLTTET DLYIRSGLTA SEFAASQQNA QPVDPPQRAS LEELIAVLQT NPKNKPQFFS TTNSNQPVIP ASSLRGMIRS LVEIASFSKV GWVSDNPQFF FRAVADNKRS IGIYYKNKIK NVQAGYVCKI GDKWVIFKAK VINGKNFCWY KPWNSLNDDI IREIGISMFN DNDFSIGYKP IYFDNPNFDE HRKFITDRIS VRNNLNNKGF ITNSGNMLET GEEGNSNRKN YCIIFEKSQE HKYTINDQAI YDYCNNLTEF QKLLGEKGCL INGRPIFFIE PTQEKGEVIY FGHSPNFRLA YRRDDLDKAV SALDCLPSEI SSNLNIPDLA ESIFGYVRQT KTNDQNRAAV NNSLSQEGEK KQALAGRVFF SNAHLSPEQG DVLKETHIPQ ILASPKPTTF QHYLVQIDPA QDQLAHYSTK GAVIRGRKLY WHQQPSNAYT TDIKQVEKAQ TQYTCITPVN PGTTFEFTIR FENLSDVELG ALLWVLDLAS CAGKYRFKLG MGKPLGLGSV AINYELQLTD RKQRYQRLFA RDGNWETGFE PSSQATQTKA IDAFCKFIDD NLGMDIEETE QIQQLRSLLA YRNGEGILFA GFSSGEDSAL RYMEIERNTK MSFLPSQEDK KNKEAMINEY DDRRVLPKPS QVAPPPPREL PRPPIADDPI KVTVRRVTR
|
| |