Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3050 |
Symbol | |
ID | 5734922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3852831 |
End bp | 3853991 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280194 |
Product | basic membrane lipoprotein |
Protein accession | YP_001545816 |
Protein GI | 159899569 |
COG category | [R] General function prediction only |
COG ID | [COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0628382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAC GCATGATTGC GTTCATGATG CTGTTGGTCT TGATGATTCC GGTATTGGCT GCTTGTGGCA CCGAAACCGC TGCAACCACA GCACCAGCGG CTACCGCCAC CACCGCAGCC GCTGCCCCAA CGGCAACTGC TGCTGCCGAA GTAACCGCCA GCCCAGCCAC CGAAGCAACC GCTGTGGCAA CCGCTGCGGC AACCGCCGAA GCAACCACTG GTGCTAGCAC CCCAAGCGAC ATCAAAATCG GTTTGGTTAC CGACGTTGGT AAAGTCGATG ACGGTACCTT CAACCAATTC GCGTTCGAAG GCTTGAAACG AGCCGAAACC GAACTTGGCG TGAAGATCGA CTACATCGAA ACCATCGATC CAAAAGACTA CGAAAAGAAC ATTGAACAAT TCGCTAGCCA AGGCTACGAT TTGATCATCG GCGTAGGCTT CTTGATGGGC GATGCAATCA AGGCCGCTGC TTCGAAGTAC CCTGACTTGA AGTTTGCTAT TGTTGACTTC GCTTACGAAC CAGCATTGCC AAACGTCAAG GGCTTGGTCT TCTCAGAAGA CGAATCAGCA TTCATCGCTG GCGCTTTGGC CGCAATGGTT TCAAAGAGCG GCAAAATCGG TGCTGTTGGC GGTAAGGAAC AAGTTCCTGC GGTCAAGAAG TTTGTACTTG GCTACGAAGC TGGCGCTAAA TACGTCAATG CCGATATCCA AGTCAGCAAA GTTTACATCG ACTCATTCAC CGACCCAACC GCTGGCGGCG AAGCTGCTAA AACCCAAATC GCTGAAGGCG CAGACGTAAT CTTCGGTGCT GGTGGCCAAA CTGGCTCAGG CGCTATCAAG ACCGCTGCCG ACAACAATGT ATTCGTGATT GGCGTTGACC AAGACGAATA CAACACCACC TTCAAAAAAG GCCCAGCCCC AATGTTGATC ACCAGCGCTA TGAAGCGCGT TGACAACGCT GTGTTCGATG TGGTCAAGGA AGTTGTCGAT GGCACCTTCG CTGGCGGCTT GTACTTGGGC AACGCTGCAA ACGGTGGGAT CGACTACGCT CCATTCCACG ATGCAGAATC AGCATTGCCA GCCGACGCTA AAGCCAAACT TGATGAAATC AAGAAGGGCT TGGCTGATGG TACGATCACC ACTGGCGTTA CCTTGAACTA A
|
Protein sequence | MNKRMIAFMM LLVLMIPVLA ACGTETAATT APAATATTAA AAPTATAAAE VTASPATEAT AVATAAATAE ATTGASTPSD IKIGLVTDVG KVDDGTFNQF AFEGLKRAET ELGVKIDYIE TIDPKDYEKN IEQFASQGYD LIIGVGFLMG DAIKAAASKY PDLKFAIVDF AYEPALPNVK GLVFSEDESA FIAGALAAMV SKSGKIGAVG GKEQVPAVKK FVLGYEAGAK YVNADIQVSK VYIDSFTDPT AGGEAAKTQI AEGADVIFGA GGQTGSGAIK TAADNNVFVI GVDQDEYNTT FKKGPAPMLI TSAMKRVDNA VFDVVKEVVD GTFAGGLYLG NAANGGIDYA PFHDAESALP ADAKAKLDEI KKGLADGTIT TGVTLN
|
| |