Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5148 |
Symbol | |
ID | 5737106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 212739 |
End bp | 213917 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282313 |
Product | hypothetical protein |
Protein accession | YP_001547904 |
Protein GI | 159901658 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0430972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGA TCAGCGAATC CGATCCCCAG TTTGTGCACT TTCAACACGG GTGTCCAACA CTCAGTCGTG AGGAACAGGT GGTCTTGTTT CACGCCCTGC GCAACCAAGG GATGACCTTT GCATCCAATG GCACAGTCGC CATTACCAAT AGTGTCATTC ATGGACCAGT TGCGGGCATT AATTTCGGTA CCATGCAGTC AATTATCCAG ATGTTGCCCG AACCGATTGA CCCCCTACCC GCTGCGCTTG CGGCCCTTGC CGCGATTCCA CTCGACCATG TGCCGCAACC CCGTTCGGAT TTACCCCACG CCTCGCGCTT GCCCTTTGAG GCCAGCCCGC ACTTTGTCGG TCGTGAGGAT GAATTGAAAC AATTGGCGGC GGCGATTGGC ACGGCCCAGC CAGCGGTCGT CATGCCAGCG GTGGCCACCG GATTAGGCGG GATTGGCAAA ACGAGCCTGG TGACGGAATT TGCCTATCGC TATGGGGTCT ATTTTCATGG CGGGGTGTTT TGGCTGAACT GTGCTGATCC CGATCAGGTG GCCAGCCAAA TCGCTGGTTG TGCACTTGCT CTCGGCATCG ACCTGACTGG CATGGCGCTC GATGAGCAGG TGCAACGGGT TTTGAACGCG TGGAAATCGC CCATGCCGCG CTTACTCATT TTCGATAACT GCGAGGATCG GGCGATTCTT GACCAATGGA AGCCCACGGT TGGTGGCTGT CGGGTACTCA TCACGGCGCG GTCGGATCAG TGGCCAACGC TCACGCAAAT TCGGCTTGGG CTTCTCTCGC CAACAGAAAG TCGCTCATTG TTACAGCGAC TCTGTGCGCG GTTGACCGAT GCTGAGGCTG ATGCGATTGC CGAGGATCTA GGGCATTTGC CGCTGGCGTT GCATTTGGCG GGCAGTTATC TCAATACCTA TTCCCATCAC ACGGTCGAGC AATACCGCAA AGATTTAACC ATTGCCCACC GCTCGCTGAA GGGTCGTGGC GCGTTGCCCT CACCCACGCG CCATGAACTG GATGTCGAAG CGACCTTCAT GTTCAGTTTT AAGCAGCTTG ATGCCAACGA TGCACTTGAT GCGTTAGCCT TGGGCATGCT TGATGGTGCG GCCTGGTGTG CGCCTGGCAT TCCAATTCCA CGCGAGTTGG TGCTGGCGTT TGTTCCCGAT GCCCGATGA
|
Protein sequence | MEQISESDPQ FVHFQHGCPT LSREEQVVLF HALRNQGMTF ASNGTVAITN SVIHGPVAGI NFGTMQSIIQ MLPEPIDPLP AALAALAAIP LDHVPQPRSD LPHASRLPFE ASPHFVGRED ELKQLAAAIG TAQPAVVMPA VATGLGGIGK TSLVTEFAYR YGVYFHGGVF WLNCADPDQV ASQIAGCALA LGIDLTGMAL DEQVQRVLNA WKSPMPRLLI FDNCEDRAIL DQWKPTVGGC RVLITARSDQ WPTLTQIRLG LLSPTESRSL LQRLCARLTD AEADAIAEDL GHLPLALHLA GSYLNTYSHH TVEQYRKDLT IAHRSLKGRG ALPSPTRHEL DVEATFMFSF KQLDANDALD ALALGMLDGA AWCAPGIPIP RELVLAFVPD AR
|
| |