Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1413 |
Symbol | |
ID | 5733321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1628208 |
End bp | 1630442 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278551 |
Product | band 7 protein |
Protein accession | YP_001544185 |
Protein GI | 159897938 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGA TTGCAGTGTT AGTCGGGATT ACCCTACTGG TCGTGATTCT GTTGGTTGTG GTGTTTTTTG CCTTGGTCAA CCATTTTTAT GTGAAAGCAC CAGCCGACCG TACCTATGTG CGAACTGGTG GCAGCAAACC AAAGGTTGTT TTCAATGGTG GTAGCTGGGT GATTCCAGCT TTTCATGAAA TTACCTGGGT TGATTTGCGC ACGATGGATA TTGATGTCGA GCGGACGGAA GCCAATGCGT TGCTGACGAT CGACCCCCAA TATGCCGATA TTCGGGCAAT TTTCTTTATC AAAGTTAATC CAATTGCTGA AGATATTGAA CGCGCTGCCC GTACCATCGG TGGCAAAGAA GTTAATACCG ATAGTGTAAA ACGTTTGGTT GAAAGTAAAT TGGAAGGTGC ACTGCGCGAC GTAGCTGCTA CCTTTACCTT GATGTCATTG CACCAAGAGC GGGAAAAATT TGTTGAGCGA GTGCAAAATT TGGTGCGTAG CGATTTAGCT GAAAATGGTT TGGTGCTTGA AGCAGTTTCA ATTACCACAT TAAAGAGTGC TCGCCAAGGG AGTTTTGGCA CTGATGATGT GTTCGGGGCG CAAGTGGCGC GGGCCAATGC CGAAGTTATT CAGCAAGCGC TCAAACAACG CAACGAAATT GATCAAATGA CCCAAACTGA AATTGCCAAG CGCAATGCTA CTGCTGAGCA AGAACGCAAT ACGATCGAGC GCCAAAAGCA ACTTGAAATT GCTCGACGTA ATGCTTCGAC TTCGCAAGAA CAAAATGATA TTGAACGCTC CAGCGAATTG GAAATTACGC GGCGTAATGC TGATGTTGAT CAAGAAAAAT TGAATTTAGA GCGAAACCTA TCGCAAGCTC GCGCCACCCA ACAACGTGAA ATTTTGATTC GCGAATCTGA GGAACGCACT GCTGCTGAGC GAGTGGCTTA CGAACAGCAA CAAGCTGCTG AATTAAGTCG GGTTGAAAAA GAACGCACGA TTGCCGAAGC TGAAAAGCTT AAAGAACAAG CGGTCATGTT GGCCGAACAA CGCAAGCAAC AAGCGATTCA GTTGGCCGAA CAAGAACGTC AGCGCGAAGT GCAGCGCAGT CAAGTTCTGC GTGAACAAGC TGTCCAAGTG GCCGACCGCG AACGCCAAGT GGCCTTGGCT CAAGAGCAAG CCAAGCTCGA ACAAGCAGAA AAAGAACGTT TGGCAATTGC TGCCGAACGT GAAGTAGCCG AGCAAGGCGT GGTTACGGTG CAAGAACGTG CTGCCGCTGA ACGTGAAGCG CAAATTCAAA TTATCAATGC TGAACGTGAT GCCAAGCGCG AAATTATCAA TCGTAAAAAT GAAGTTGAGC TTGAAACCTT CCGCCAAATT AAGCAGGCCG AAGCCGATGC TGAAGCCTTG AACAAAAAGG CAACCGCCGA AGCAAGTGCC GCAATCAAGA TGGCCGAAGC TCGCCGTACC GAAGCCCAAG CCATGTCCGA TGCGGAAATT CTGCGGGCTG AAGCAACTAA AGCAACCGTT GCTTCCCAAG GTTTGGCCGA AGCTGAAGTG ATCAAGGCCA AAGCTGATGC CGCCCGTGTC GAAGCTGAGG CAATTCGTGA GCGTGGTTTG GCGGAAGCTG AAGCCGCTCG CGCCAAAGCC CTCGCCGAAG CCGAAGGTCA AAAAGCCTTG GCCGAAGCTT TGGCTGCTCA CGCTGGGGTA GCCCAAGAGC TAGAACTTGA ACGGATTCGC ATGCACGCCC AAGTTGAAAT TGGCGTGGCT CAAGCCAAAG CCATGGGCGA AGCGATGGCA GCGATGGACT TCAAGCTTTA TGGTACGCCC GAAACTGCTC AACAAATTCT GCGCATGATA GGTTTGGCCG ACGGCGTTGG CAGTTTGATC AACACCGCAC CTGCTCCATT AAAAGAGCTT GGCAATCGTT TGATCAACCG TGTGTTGCCA GCGAATGGCA ATGGCGATGC TGAAAAATCG GCTAGCAACG ACAATGGCTT GAATCTGACC GCTGCCCAAC CAGTGTTGCG CGAAGCAGCC TTGATTGCCA GCCAATATTT GAGTGCCGAT GAGTTGCAAA CTTTGACGGT TGGTGCAGCC CTCGAACAAG TCTTGGGTGT CGCTAGTGAA GAGCAACAAG CAGTGTTACA TAAAGTCCAG GGTATGCTGC AATTGATGCC CCAATTGGCT GACCAACCAT TGAGCAGTGT CTTGATGTTG GTGCAAAATA GCTAA
|
Protein sequence | MEQIAVLVGI TLLVVILLVV VFFALVNHFY VKAPADRTYV RTGGSKPKVV FNGGSWVIPA FHEITWVDLR TMDIDVERTE ANALLTIDPQ YADIRAIFFI KVNPIAEDIE RAARTIGGKE VNTDSVKRLV ESKLEGALRD VAATFTLMSL HQEREKFVER VQNLVRSDLA ENGLVLEAVS ITTLKSARQG SFGTDDVFGA QVARANAEVI QQALKQRNEI DQMTQTEIAK RNATAEQERN TIERQKQLEI ARRNASTSQE QNDIERSSEL EITRRNADVD QEKLNLERNL SQARATQQRE ILIRESEERT AAERVAYEQQ QAAELSRVEK ERTIAEAEKL KEQAVMLAEQ RKQQAIQLAE QERQREVQRS QVLREQAVQV ADRERQVALA QEQAKLEQAE KERLAIAAER EVAEQGVVTV QERAAAEREA QIQIINAERD AKREIINRKN EVELETFRQI KQAEADAEAL NKKATAEASA AIKMAEARRT EAQAMSDAEI LRAEATKATV ASQGLAEAEV IKAKADAARV EAEAIRERGL AEAEAARAKA LAEAEGQKAL AEALAAHAGV AQELELERIR MHAQVEIGVA QAKAMGEAMA AMDFKLYGTP ETAQQILRMI GLADGVGSLI NTAPAPLKEL GNRLINRVLP ANGNGDAEKS ASNDNGLNLT AAQPVLREAA LIASQYLSAD ELQTLTVGAA LEQVLGVASE EQQAVLHKVQ GMLQLMPQLA DQPLSSVLML VQNS
|
| |