Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0801 |
Symbol | |
ID | 5732701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 905439 |
End bp | 906626 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277932 |
Product | hypothetical protein |
Protein accession | YP_001543577 |
Protein GI | 159897330 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGGC GACGAGCAGG CTTATTTGTT TTGGTCTTGC TCTTGATCAT TGGCTTGCTA AGCGATGTAA CGTTGTTGCC GTTGGTCGCG ATGCTGGGGA TTATGGCCTT GCTCGCGGTC GAATTATGGG AATGGCGCAT GTTTAAGCAT GTTGATTATC AACGTGAACT AGGCCATACC CATCTGTTCC CCGACAACCG CACAACCCTT AGCATTACCC TGCGCAATCG CAAATTTTTG CCCTTGCCAT TTGTTAATTT GCACGATTTG GTTCCAGTAG GCATCACGCT TGAGCAGATT GAAACCCAAC CTGCTGCTAG CCCCAATTAT CGGGTGCTAG CGCGGGCATT TGGCATCAGC AGTTATCAGC AAGTAACCCG CCAATATACA ATTTTGTGCC CACAGCGCGG TTTGCACCGT TTTGGCCCAG CCAATTTGAG TGCTAGCGAT CCGCTTGGGC TGAGTATTAG TCGCGCAACA ATTAATGAGA TTGATCGCTT GATCGTCTAC CCTCGCTTGT TAACTGAGCC AGAATTAGGC TTGCCGTTAC GCGAGTTATT GGGCACGATT CGCGCCTCGC AGCGCTTATT GACCGACCCT GTTGTGCCGA TTGGTATCCG CGATTATACC CAAAGCGACC CGCTCAAAAG CATTCACTGG ACGGCGACAG CGCGGCGCGG CCAATTACAA ACTCGCATTT ATGAGCCAGT TACGGCGCTG ACCGTGATGT GTATTCTTGA TATCGAAACG ATTGTGCCAT CCTATCTTGG GGTGAATAAA TTTCAGGGCG AACGTTTAAT TAGTATGGCG GCAACGGTTT GTAGCGCTTT ACACAAAGCT GGCCATGCGA TTGGTTTATG GTCGAATGCC GCGCTGGTTG AGGGCAACAC GGCCATTCAG CTGCCGCCCA ATCGTAGCCC CAAACAGGCC AGCGCCATCT TGGAAGTATT GGCCCAAATG TCGCTCTACT CGCGGCTAGA AATTGCCAAA TTTATTGGGC GTGAACAATC ACGCTTACCG CTAGGCGCGA CGGTTTTGCT GATTAGTGCG GTGGATACGC CGGCTCATCG CAGCGCCTTG GCCCGTTTAC GCGAATATGG CTATGCCCCC GTTTGGCTCT ATTTAGGCCA GCATGCACCA AAAGTTGCTG GAGTTAAGTT GATTCATAGT CACCAGAGGG AGCCATGA
|
Protein sequence | MKGRRAGLFV LVLLLIIGLL SDVTLLPLVA MLGIMALLAV ELWEWRMFKH VDYQRELGHT HLFPDNRTTL SITLRNRKFL PLPFVNLHDL VPVGITLEQI ETQPAASPNY RVLARAFGIS SYQQVTRQYT ILCPQRGLHR FGPANLSASD PLGLSISRAT INEIDRLIVY PRLLTEPELG LPLRELLGTI RASQRLLTDP VVPIGIRDYT QSDPLKSIHW TATARRGQLQ TRIYEPVTAL TVMCILDIET IVPSYLGVNK FQGERLISMA ATVCSALHKA GHAIGLWSNA ALVEGNTAIQ LPPNRSPKQA SAILEVLAQM SLYSRLEIAK FIGREQSRLP LGATVLLISA VDTPAHRSAL ARLREYGYAP VWLYLGQHAP KVAGVKLIHS HQREP
|
| |