Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1334 |
Symbol | |
ID | 5733226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1544738 |
End bp | 1546813 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641278472 |
Product | hypothetical protein |
Protein accession | YP_001544107 |
Protein GI | 159897860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0854082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTTG ACGATATTAT CACCGATGGC AAACATATTA CGCAACTACG CACAGTTTTA GTAAAATCAA TGGGTGAGAA TGGTGCAGAT AAGCTTATTA ATAATATTCT AGAATTAATT CAAGAATTAC CTTCTGCCGA AGAAGGAAGT CAAAACCGAC ATGGATTGCT TCTAGGCTAT ATTCAAAGTG GTAAAACATT TGCTTTCACT ACAGCTATAG CATTAGCGGC AGATAATGGA TATCGACTCT TTATTATTCT TACTTCTAAT AACCTTATAC TCTATAATCA GACAATTGAT GAGCGGTTGA AACAAGATTT ACAAAGTATA GAAGTGGAAG GGAAGGATAG TTGGGAACAA AAGATACTAA TGATGACCCA AACCCTTAAA GATCCTAAGG GTGTTTTAGT ATTAGTTACA ACAAAAAATA CTGCTATTCT TTACAAGTTA GAACAAACCC TTAGAACAAT TCAAGAAGAG CTCAAGATGG GCCTTCCTAT AGCATTAATT ATTGATGATG AAGCTGATGA GGGTGGATTA GATACTAATA CTCGAAGAAG AAGCGTTAAT CCTCTTATAG AGGCTGGGCC TACGTTCAGT GCTATTGAAG AGATACGTCG TTTAGTTCCT AATCATGTCA GATTACAGGT TACAGCTACT CCTCAAGCAC TCTTTCTTCA AGATTCTGGA CATGAATCAA GACCTGGTTT TACTGTTTTA TTGGAACCAG GGGCTGATTA TGTTGGAAGC GAACAGTTTT TTGCGCTGAA ACAAGAAATT GACATGATTT ATGAAAATGA TGATGAAAAT GAATTAGAGG AACGTAAATC AAAAATTATA CGAAGAATCG ATCAGCATGA TATCCATATG ATGATTGAAC AAGAAGGTGA TAGTATTCCA GATAGTCTGC GAGATGCATT ACTAACATTT TATATTGGAG CAACTATCAA GATAGTTGAT GAACCTAGTA CTAGATTTTC TTTTCTTTGT CATATTAGTG CGAGAAAAGC AGATCATGAT AAAATTAGTC AAATAATAAA TAAATATATA GGAGTACTTA GAAAATCATT AATAGATTAT GTTGATAATA ATATCACAAG TGAAGATATA TATTATCTAG AAAAAATATA TACTGACATA ATAAGTACAT ATGAGGATGG TATTTCATTA GGAACGATAA TTAATGAATT AAGAGAGTCT ATTATAAAAA CAGATATAAG TGTAATTAAC AGTAGTACGA CCTATCAACC AACATATTCA GGAAAATATA ATATTTTCAT TGGAGGAACT AAAATAGCGC GTGGGGTCAC CATAAAAAAT TTAATTGTCA CATATTATGG GAGACAACCA AAAGTAACAA ACATGGATAC CATGCTTCAG CATGCAAGAA TGTATGGGTA CAGAAAAAAT CATATGGATG TTACAAGACT ATTTATAACT GAAGAAATTG AAAAAAGATT TACTGTTATT TATGAATCAG AAAAAGCATT ACGTGATTTA ATAAAAAGAT ATCCTAATGA AAATTATCGC AGTATTATTA TAAATAACAC GGTAAGAGCA ACAAGAAACA ATGTTCTAAA TAAGTTTAGT ATAGGATATT ACGTTTCTGG AAAGAATTAC TTACAAAGAT ATCCATATTA CAATAAGTCA GATATAGATA AAACTACTAA AAATATTGAT GCCATATTGG AAGACTATCC AACTACCGGT ATCAAGACCG AGGAAAAAGA GGTTGATATA GAAATTCTGA TAGATATATT AAATAATATC CATTCGGTAC CTAGAACTTT TAGTCTTTGG AATGACAAAA AAATTATATC TGCACTGGAA TTAATGAAGA CAGGAAACAT TACGAGAGGT CTTTTAATTG TTAGCCGTAA TCGAAATATT GGTAGTAAAG ACAAATTTGG TGCTTTATTA CCACCCGGCT ATAAAGCCAA AGCAAGCCGA GAATATCCAA CTTTATTTAT ATTCAAAGTT ACTGGCGAAA ACTGGAATGG AAAACCTTTT TGGATACCTG CAATAACATT TCCAGATACA AAAGACAAAT ATACTTTTGT CTTTAATCTT TCATAA
|
Protein sequence | MELDDIITDG KHITQLRTVL VKSMGENGAD KLINNILELI QELPSAEEGS QNRHGLLLGY IQSGKTFAFT TAIALAADNG YRLFIILTSN NLILYNQTID ERLKQDLQSI EVEGKDSWEQ KILMMTQTLK DPKGVLVLVT TKNTAILYKL EQTLRTIQEE LKMGLPIALI IDDEADEGGL DTNTRRRSVN PLIEAGPTFS AIEEIRRLVP NHVRLQVTAT PQALFLQDSG HESRPGFTVL LEPGADYVGS EQFFALKQEI DMIYENDDEN ELEERKSKII RRIDQHDIHM MIEQEGDSIP DSLRDALLTF YIGATIKIVD EPSTRFSFLC HISARKADHD KISQIINKYI GVLRKSLIDY VDNNITSEDI YYLEKIYTDI ISTYEDGISL GTIINELRES IIKTDISVIN SSTTYQPTYS GKYNIFIGGT KIARGVTIKN LIVTYYGRQP KVTNMDTMLQ HARMYGYRKN HMDVTRLFIT EEIEKRFTVI YESEKALRDL IKRYPNENYR SIIINNTVRA TRNNVLNKFS IGYYVSGKNY LQRYPYYNKS DIDKTTKNID AILEDYPTTG IKTEEKEVDI EILIDILNNI HSVPRTFSLW NDKKIISALE LMKTGNITRG LLIVSRNRNI GSKDKFGALL PPGYKAKASR EYPTLFIFKV TGENWNGKPF WIPAITFPDT KDKYTFVFNL S
|
| |