Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2515 |
Symbol | |
ID | 5734393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3212254 |
End bp | 3213579 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279655 |
Product | hypothetical protein |
Protein accession | YP_001545281 |
Protein GI | 159899034 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACGT GGTGGAAAAT CCGCAGTTTC GGCGTGCTCG TACTTTTGGG TTTAGTTTTT GGCTTGGCTC CACAACCTTC ACAGGCAGGC GGGGTTGTCT ACGTTGTGCC AGGTGGTGCT GGTAGCCAAA CTGGCACCGA TTGGGCCAAC GCCAAAGATC TTCAGGAAGC GATCCAAAGC TCGACGACCG GCACAGAAAT TTGGGTCAAG CAGGGAATTT ATCGCCCAAC AGCTGGGACA GATCGCACAG CCGCCTTTAC ACTTACCAGT GGCATTCAAC TTTTTGGTGG CTTTGCTGGT GATGAAACCT CGCGCCAAGC CCGGAATCCC AGCCAAAATC CCACAATTCT CAGTGGCGAT TTGCTTGGCA ACGACGTAGG TGCAGCCAGT CAAAGTAACC CAACCCGCAG TGATAACAGT TACCACGTGC TTTATTGTAT TACAAAGCCT GAGCCAATGT TAATCGATGG CTTTGTGATT AGTGGTGGAC AAGCGACAAA CCAAATATTT CCACAGAATA TGGGCGGTGG TGGTCTATTC GATCGGTGCA ACTTAACAAT ATTAAATAGT CGTTTCATCA GCAATGCTGG CCTGTTTGGC GGTGGGATTT TGAATTTCAA TACTAGCTTA ACGATTAATC AAAGTATCTT TGCTGGCAAT TGGGCACTAA GCACTGGCGG CGGGCTTGAA AACCAACAAA ATGGGATTGT TAACCTGCGA TCAAGCGCAT TTGTCGGCAA TATTTCCCAG CAAAATACCG TGAGTGCGAT TGCCAACGCT AGTGGCACAC TTAATCTGCA TAACAGTATC GTTTGGGAAA ATAATGGCAT CGCGCCCATC AACGGTTCCA ATAATAATCT CAACCTCAAC GTGAGTTATT CAATTGTTCA AGGCGGCTTT ATTGGCACTG GCAATAGCAG CAGCGATCCG CAGTTTGTTG ATGCTGATGG TGCTGATAAT CTGGTTGGCA CGCTCGATGA TGATTTGAAT TTGCAGGCCA ACTCAGCCGC CCGCAATACG GGCAATCCGA GTTTACTGCC AAGTGATCAA ACTGATCTCG ACGGCGATAA CGATGTTAGC GAGGCTATTC CCTTGACAAT CGATTATCGA CCACGGATCA ACGAAGGTTT CGTTGATATG GGTCCATACG AATATCAAGG CAATGAGCCA ACCATAACTC CGACGGCAAC GGCTACACCC AGCCCAACTG TTACAGCAAC CGCTACAGCA ACAGCAACCG TTATGCCTAG CGAAACCCCA ACTCCAACCG CAACGGCTGA AATTCCCCCA ACAACCCAGA TCTATCTACC AGCGGTCATG CGCTAG
|
Protein sequence | MNTWWKIRSF GVLVLLGLVF GLAPQPSQAG GVVYVVPGGA GSQTGTDWAN AKDLQEAIQS STTGTEIWVK QGIYRPTAGT DRTAAFTLTS GIQLFGGFAG DETSRQARNP SQNPTILSGD LLGNDVGAAS QSNPTRSDNS YHVLYCITKP EPMLIDGFVI SGGQATNQIF PQNMGGGGLF DRCNLTILNS RFISNAGLFG GGILNFNTSL TINQSIFAGN WALSTGGGLE NQQNGIVNLR SSAFVGNISQ QNTVSAIANA SGTLNLHNSI VWENNGIAPI NGSNNNLNLN VSYSIVQGGF IGTGNSSSDP QFVDADGADN LVGTLDDDLN LQANSAARNT GNPSLLPSDQ TDLDGDNDVS EAIPLTIDYR PRINEGFVDM GPYEYQGNEP TITPTATATP SPTVTATATA TATVMPSETP TPTATAEIPP TTQIYLPAVM R
|
| |