Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0040 |
Symbol | |
ID | 5731912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 49590 |
End bp | 51005 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277161 |
Product | hypothetical protein |
Protein accession | YP_001542820 |
Protein GI | 159896573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATCATT TGCCAACGTT TCGTACATTA GGATTGCTTG GTTTGTTGGG GTTATTGTTG GTTGCCATAC AACCGAGCCA AGCCCAACAA CCACTATCAC CAAAAATTAG GCCCGAGCCT AGCCTGAGCA ATCGGCAGCC ATTGCTTAGT GCAGCCCCGC AAGGTGGCAG TGAATGGCTG GTTCCTTGTG CTGCCAACGC TGAAAATTGG CAAAGTAGTG TGCGTGAAGA TACAGCTGCG CTCAATTGCG AAATTTATGT GCCCGAATCG GCCATGATTT TTGTGATGGC GACCGGCAGT TTTAGCATGC AAACCAATGC TGTAACCAAT ACCTACGAAG GCCGAATTGG CCTAACCCTG GATGGAGTTT TGCGCGAGAG TAGCAATCGT TGGGGCAATG TTTATGCTTT AGGTCAACTC AATACTGGCG AAACCAGCAT TTTTGCCAGC ACCACGGTCT TTACGGCCAG TGCCGGGGTG CACACAGTGA GTTTAGTTGG GGGCTTTGTT GGTTATGGCC CGCTGACCTT GAACAACGCT CAACTAAGTG TTTTGGCCTT CCCAACCAAC TCGGCTAATA TTCGGGTTTG CGCCACCGAT ACGGGCTTAG GGGTCTGGCG AGCAACGCCA ACCATGAGCA CCATTCGTTC GTGCAGCTTC AACTTGCCAA GTAATAGCAC GGTATTTGTC AGCGCCGATG GTTCAGCTTT GCCAATTCCT GGCAACGAGG TAGCCTTGCA ATTTCGCTTG GGCGTTGATG AGGCTACCAC TGGCGATGTA CGAACTGATC GCTATGTTGA TGTTGATAGC TTAGAACCAA ATAATGAGCA AATCGATGGC AATGATATTT CGACTTCGAT TGCCGCTAGC TTCAACCTAA CTGCTGGCAA CCACACCATT AATTTCTTGG GAAATGGCAG TGGCAGCAAT CAGGCCTATC TGTCGCGCTC AAGTTTGGCA GTGTTGGCCT TCCCGAGTGG TAGCCCATTC CGTACCTGCA CCAGCATGAA CGATACTAGT ACCTTTTTCA GCAATAGCGA ATTTAGCTCA TGGGCCAATT GTCTGTTGAC CGTCTCTGGT GCTCATCGCG GGATTATCGT GGGCAATGTG ACAGTTGGTC AGCAAAATGG CGAGCCGCAA GTCCGCACCC GTCTGCGAGC CAATACCGAG GTTGTGCTGG GTAGCACGCG CACCAGCGAT TTGACGATCT TCCGTATGGT TGGCGGTCAA GGCGATGACA AGACTATGAC TTCGGTTGGC ATGAGCGAAT TGGCTGGCGG CCTGAATATC TTCAATTTTG ATGGCTATCC ATCCAACAGC AGCACCGTGC GCATGATCGA TCCTAACATT CATGTACTAG CATTTCCCGA CCCATTCAGG TATAAACAAT ACACGCCATT AGCGCTTGGC GAATAA
|
Protein sequence | MHHLPTFRTL GLLGLLGLLL VAIQPSQAQQ PLSPKIRPEP SLSNRQPLLS AAPQGGSEWL VPCAANAENW QSSVREDTAA LNCEIYVPES AMIFVMATGS FSMQTNAVTN TYEGRIGLTL DGVLRESSNR WGNVYALGQL NTGETSIFAS TTVFTASAGV HTVSLVGGFV GYGPLTLNNA QLSVLAFPTN SANIRVCATD TGLGVWRATP TMSTIRSCSF NLPSNSTVFV SADGSALPIP GNEVALQFRL GVDEATTGDV RTDRYVDVDS LEPNNEQIDG NDISTSIAAS FNLTAGNHTI NFLGNGSGSN QAYLSRSSLA VLAFPSGSPF RTCTSMNDTS TFFSNSEFSS WANCLLTVSG AHRGIIVGNV TVGQQNGEPQ VRTRLRANTE VVLGSTRTSD LTIFRMVGGQ GDDKTMTSVG MSELAGGLNI FNFDGYPSNS STVRMIDPNI HVLAFPDPFR YKQYTPLALG E
|
| |