Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4466 |
Symbol | |
ID | 5736317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5711556 |
End bp | 5712794 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281629 |
Product | hypothetical protein |
Protein accession | YP_001547226 |
Protein GI | 159900979 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000364274 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATGGA GCATGCGGCT ACGTTCGCTC TGCATTCCTG CTCTATTGAC GTTCATGATC GCTGGTTTTG CCGCGTTTAT GTTCAGTCAA CCGCACGCTA GCTTCGCTCA AGATAGCGAA GGTTCACCAC CTGATTGCCC TGGCTTGGGT GATCTCGGTG CTTGGTTCAC CGCTGAAAAC GAAGTTGTTA TCATCAATCG TTCGAGCGTT TGTACCTACG AGGTTGGCAC GGCTGTCTAT AAAAAATTCG ATGCCGTGAC CGATCATCAG GTGCTTTTCG CCTCGCAAAC TGATCAACTG CCCCCCAATA CCCAACGCAC CTATCGCCAA ACCTTAAACC CTTGTGCCGG CCAAATGAGC GCCTTCTTCG GGCCGGTGCT GCCAAGTTTA ATGAACCAAC GCTACAACGA GCGCCTGTTG GCAGCCAATC ATTTTGGTGG CACGAATTAT TGTTTGCGCA GTTGTGACTC GGGTCAATTG CTAGGTTGGG TCGGGGTCAG CTCGAATGCA ATTTTGCAAA GCTCAGCTCG CATCGGCATG AAAATCACAG GTGCACCGCC ATGGCAAGTT GAGTTTCGCC TGACTGGCCC GATGACCATG ACTCACATCG AGCAAATTGA TCCCTATTTG TTCACAGGCA ATACCTGGAA TTTAAGCGAC GTACCAATTG GCGATTACAC CATGAAAGCC ACCTATATTG CCAACCCAGG CGTGCGCTGC GATAGCAAAA CGATTCAATT TAAAGTTGCT AGCACCCCGC TGCCAACTCC CACGCCATCG CCAACCCCCA CCCCAGTGCC AGTCTATCGG GCGCGAATCG TGCTCAATTC ATTGGTTATT TGGGATGCGA ATCGTGGTGA TGGGCTAACC AATTTCGATG GATTTGTCCG GCCAGGCGAT ACAATTTCCT ACACCCTTGC CATTCAAAAT ACTGGCAACA TGCCCCTGAC CAATGTGCAA ATCGTCGATC CCTTCTCGAA TGCGACCAGT TTTGCGGGAT TAAGCCTACC AGCACCCCAA ATTCAGCCAA GCCCCGATGG TTCCGGCCCG ATTGGCTGGC AAATTCCCGT CTTGCAGCCC AACCAGCGCA TCGATGCCAC CTTCCATGTG ATCGTCAAAG AATCGATTCG CGGCACCTAT ACCCAAATTA TCAATAATGC TAGCTTTGCC AGCAACGAAA CTGGCCAACA GTGGAGCAAT ACCACTGAGC ATATTTACAA TCCAGATTTT GATCAATAA
|
Protein sequence | MQWSMRLRSL CIPALLTFMI AGFAAFMFSQ PHASFAQDSE GSPPDCPGLG DLGAWFTAEN EVVIINRSSV CTYEVGTAVY KKFDAVTDHQ VLFASQTDQL PPNTQRTYRQ TLNPCAGQMS AFFGPVLPSL MNQRYNERLL AANHFGGTNY CLRSCDSGQL LGWVGVSSNA ILQSSARIGM KITGAPPWQV EFRLTGPMTM THIEQIDPYL FTGNTWNLSD VPIGDYTMKA TYIANPGVRC DSKTIQFKVA STPLPTPTPS PTPTPVPVYR ARIVLNSLVI WDANRGDGLT NFDGFVRPGD TISYTLAIQN TGNMPLTNVQ IVDPFSNATS FAGLSLPAPQ IQPSPDGSGP IGWQIPVLQP NQRIDATFHV IVKESIRGTY TQIINNASFA SNETGQQWSN TTEHIYNPDF DQ
|
| |