Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2131 |
Symbol | |
ID | 5734019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2673973 |
End bp | 2676378 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641279272 |
Product | hypothetical protein |
Protein accession | YP_001544899 |
Protein GI | 159898652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACTA TTGAACAGCA GCTTGAATTG TTGCCGCGCT TGTGGCGCTA TAGCCTGCTG CGCACCAGCC TCACGGCCCA TGCCGATCGT TGGCCTGATG AACTCTATGT TATGTTGGCA ATCATTGGAC GTTTACCAGA GGCCTTGGCA CAGATTAATG TTTGCGCTTA TCCAGAACGT CAAGTGCTTG CTTGGGCGCG AGTGATTAAA TACGCCGAGC CGCAGGTTCA ATTACAGCTT TTGCAGCGCA TCGAGCATTC GATTCGTTCT ATCCGTGATC CTGAAGATCA GACCTTTGCT TTGTCGTGTC TTGGCCTGGC CTATGCTGAG GCTGGTATTC CTAATGCGAC CTATCCAATC TATCAAGTTA TTGATCGGCC TAATCTAACG GTTCAAGGGT TGTTGCTGCA GGCAAATAGG TTGGCTAGCC AAGGTTTGTC TGACCAAGCG TATTTACTTT TTGATGAACT ATTCATGACG ATTTTCGCCA TGCCTCAGTA CGAGCAATTG TACCAATTAA TGTTGCTTGT TCAGAGCGCC AAGCGTGCTG GGTATAATTC GCTTTGTGAG CGGATTATTC AGAGCCTCTA TGTTCCGCAC GAAGCCCCTA CGTTCAACTC GGCAATTCAG GAGCTTGCTA AAGCCTATGC GGCATATGGT GATTTTGCCG TTGCCCATCA GGCGATTCAA TTGATTAAAC AGCCGCGCAG TTTTATCCAT GCGGCCCGGC AAGTGGCGGT GATTGCCTGT GAAAAACAAA TCCATACCCA TACAGCAAGT TTGCTACAAG CAGCCCATGA ACGTGTTAAG CAGCTTGAGG ATATTGATGA ACACATTTAT CTGTTGGGAC AACTAGCCAT CCCTGCACGG CAAGCTGGCT TAATCGAGCT GGCCCAAACC TTGATGGATG AAGCTTTTCA TAAACTAATT GGTGTGCAGC ACCAATATCC GCCTGTAGCT ACTAGACTGC TTATTCAGAG TTATCAATCT CAACATGCCT TGGCTGATGC CTTAGCGATC ATACCATTCC TTGATAATCC CCAAGCCCAT GATTACGTGC TTGGTAACAT TATTGAGTGC TATTTGAATG ATAATGATCT TACGAATGCT CATCTGCTGT TGAAATTATT TAAACCCCAT GAACAGACAT ATGTAGCTGC CGCTAGTAAC CTATTGATTA AGGCTGGTGC GCAGGGATTA ATCGAACTCG TTTGGCGGCT TTATTGGGAT GTCATGGCTA TATCTAAAGC GATTAATGAT CATGTTAATC ATCGTAATTA TTTTGTCTAC GTTGCCTGTA ATCTTGCGAC TGAGGCGAGC ACGCATGGGT TAACCGTGCT AACGCCGCGA TTGTATAGCG AGGCGATTCA GGCATGTACC ACAGTTGATC ATGGATATAC TCGGCTGCGG TATCTCAAGG ATTTAGTGCT TGCTCAGATG AAACATGGTT TGGTTGCGAG TTTCCCCAAT TTGTTAGCTA GTTTACGCCT AGGAGCTACC CAATTAGAGA TTAATACTGC ATTGAATGAA TTTCTTTGCC CAATCGCGGT GGTTTATGCT GAACAGGGGA ATTATTCGGC ATTTGATGAT TGGTTCAATT ATGCCCATAC CCAATTGAAA AATAGCACCC AAACTGATCA TAAAGCGCTT GTGTCGGGCT ATCGAACGCT GATTAAGACC TATCGTACAT CTGCTTCTGA TTGGATGAAT TCAGCATTTT TAGCAGCGGT GTTGCCAAGG TTGCAGGTGA TCGCCAACAC AACACATTTG GCTAGTGCTA AAAATCTATT GATAAACATT TATGCGGATT ATGCGAGTGA AGGGCATCCA GCCTTTCTCG CTCAAGCATA TGAGATGGCG ATTACAATTG AACCGATAGC TGATCGGCTC AATGCGCTTA AATCACTTGC CAAGGTCTAT GCGAAAGTAA ATGATGGGCC GCATTTACGG GCAATTATTG CTGAGATGAT TGAGCTTGAG CTTGATGATT TAGAGTTTGA GTCGATTGCC TTGGTCTGCG CTAAACAGGG AGATTTTGCC TATGCCCAAG AATTGCTGGC ACGCCAAGAG CCAGCCCCAT GGAAAGATGA AGTTTTATGG TATTTGATTG CCAAGCTGAT TCAAACCAAT CAAGTGGCTA CTGCATGTCA GTTAATTCCA AGCCTAAGTG AAGGCTACAA ACAAGAACGC GAGTTTCAAA AAATCATTAC CTACTATCTT GAACGTCAAC AATTGGCTGA AATTGGGCAA ATTGTTCAAG ATGTTTGGCG TAACTGTATG AGCGATACTG AATTATGGCA ATTAAGTACA ATCATTGTGC CGTTGATTCC CCACTACCCA TGGCTTGGCA TTGCCGTGCT TGATAGCGTG CCATGGGTTG AACAGCAGTT AGCTCGCTTG AAGTAA
|
Protein sequence | MDTIEQQLEL LPRLWRYSLL RTSLTAHADR WPDELYVMLA IIGRLPEALA QINVCAYPER QVLAWARVIK YAEPQVQLQL LQRIEHSIRS IRDPEDQTFA LSCLGLAYAE AGIPNATYPI YQVIDRPNLT VQGLLLQANR LASQGLSDQA YLLFDELFMT IFAMPQYEQL YQLMLLVQSA KRAGYNSLCE RIIQSLYVPH EAPTFNSAIQ ELAKAYAAYG DFAVAHQAIQ LIKQPRSFIH AARQVAVIAC EKQIHTHTAS LLQAAHERVK QLEDIDEHIY LLGQLAIPAR QAGLIELAQT LMDEAFHKLI GVQHQYPPVA TRLLIQSYQS QHALADALAI IPFLDNPQAH DYVLGNIIEC YLNDNDLTNA HLLLKLFKPH EQTYVAAASN LLIKAGAQGL IELVWRLYWD VMAISKAIND HVNHRNYFVY VACNLATEAS THGLTVLTPR LYSEAIQACT TVDHGYTRLR YLKDLVLAQM KHGLVASFPN LLASLRLGAT QLEINTALNE FLCPIAVVYA EQGNYSAFDD WFNYAHTQLK NSTQTDHKAL VSGYRTLIKT YRTSASDWMN SAFLAAVLPR LQVIANTTHL ASAKNLLINI YADYASEGHP AFLAQAYEMA ITIEPIADRL NALKSLAKVY AKVNDGPHLR AIIAEMIELE LDDLEFESIA LVCAKQGDFA YAQELLARQE PAPWKDEVLW YLIAKLIQTN QVATACQLIP SLSEGYKQER EFQKIITYYL ERQQLAEIGQ IVQDVWRNCM SDTELWQLST IIVPLIPHYP WLGIAVLDSV PWVEQQLARL K
|
| |