Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3940 |
Symbol | |
ID | 5735801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4936064 |
End bp | 4937608 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281091 |
Product | hypothetical protein |
Protein accession | YP_001546702 |
Protein GI | 159900455 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.405869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGC CGCCGTTGTG GCATCTCTCC CTGAATCGTA GCAGCGTTGC TACCACCAAG CTGCATTCCC AGATGCCCAG TTTTGCATCC ACCGCGTACC TCCGGAGGTT TCTGTTGCGA CACATTGCCC GTTTCTTGGC GCTTGGTTGT CTGCTCGCCG TGAGCATGAC CCCCATCCGA GCGCAGGCGA ACCCAACCTT GTGGGCTGCT GCTGCATCAA CCGACACCCT CGTTCTTGCT TCTGGTCAAC TGACCCTCGC CCCGAATGCG GCCCAAGCGG CTGCGGTCTA TGGCTCATAC GCTCGTTTTG GAATGTTCGA TAGTGCTCCG CAAAACATCG CTCCCGCCAA TCAAGTGCTG GTTACATGGG GCGCAACTGT GCCCGCTGCC GCTAGCGTGC GCGTCGATGT GCGTGGCTTC AATGGCCAAC GCTGGAGCGA TTGGACGCTT GATGTGCAAT CGGGCCAAAC GGTAGCTTTT GCCACCATCG CCCGCCAAAT TCAATATCGT TTGGTGCTAT TGGCCAACGA GGCTGCGCCA GTCGTTGATT TTGTGCAACT TGCGCCCAAC ACGCTTGCCG AAAGCGATGC CATCAGCATT ATGGAAGATG AGCCGATTGC TCCAACCTAC CATATTCGGG CTACCCGGAT GGGCTTGGTT GGCGATCGCA CGGCCAACGG CCATATCATT CAGCCAAACG ATTGGTTTGT TTCATTGCCA TCGTTCCGCT CACTCTCATC GCGTGGCGGC GGCGAATACA TGGCGCGGCT TTCCTATCGT GGCAAATCGA TTGTTGTGCC AGTTTGGGAA GTTGGGCCAT GGAACATTCA CGATGATTAT TGGAATGTTG AGCGCGAGAA ATTTGGCGAT TTGCCTGTCG GCTGGCCCCA AGATCACGCT GCCTATTTCG ATGGCTACAA TGGTGGCTGG GCTGAAAAAG GCCGCGTGCG ATTCCCCACC GCTGCTGATG TCGGCGATGG CGCATGGGTC GCCTTGGGCA TTCCATTTAA CGATGAACAA GAAGAACTTG ATATTACCTT CTTGTGGCTA GGCCGTGATC CTGGCGATAA CCCCGACCCA ATGCCAGTTG GCAGTGTTAC GCCTGAGCCA GCCCCAATTG AAGAATTACC AGCGGGCACG ATTCAGGTCG ATAATCAAGG CGAACAATTC AGCCGCTCCG ATGTGGCATG GTTTGAATTT TCGTGTGGTA AAAATCGCCA TTCGTTCTGG ACCTTCTCAA CCAACAAGCC TGAAGAAGCA GTCAATAATG CGCGTTGGAC AACTCCGCTT GAGGCTGGTG ACTATAGCGT GACCGTGTTT GTGCCCTACT GCCCCAATGG CAAGAGCGAT ACAACTTCAG CACGTTATGT TGTGCAACAT GCCGATGGCG AAACTCAAGT TGTCGTCAAT CAAGCGGAAC ATGCTGGCAA CTGGGTTGAG CTAGGCCGCT ATCGCTTTGA TGGTACTGGC ACAGTTAGCC TCAGCGATTT GGCCGACGAC CGCATGAAAG CCATTTGGTT TGATAGCGTG CGCTGGACAA AATAA
|
Protein sequence | MPMPPLWHLS LNRSSVATTK LHSQMPSFAS TAYLRRFLLR HIARFLALGC LLAVSMTPIR AQANPTLWAA AASTDTLVLA SGQLTLAPNA AQAAAVYGSY ARFGMFDSAP QNIAPANQVL VTWGATVPAA ASVRVDVRGF NGQRWSDWTL DVQSGQTVAF ATIARQIQYR LVLLANEAAP VVDFVQLAPN TLAESDAISI MEDEPIAPTY HIRATRMGLV GDRTANGHII QPNDWFVSLP SFRSLSSRGG GEYMARLSYR GKSIVVPVWE VGPWNIHDDY WNVEREKFGD LPVGWPQDHA AYFDGYNGGW AEKGRVRFPT AADVGDGAWV ALGIPFNDEQ EELDITFLWL GRDPGDNPDP MPVGSVTPEP APIEELPAGT IQVDNQGEQF SRSDVAWFEF SCGKNRHSFW TFSTNKPEEA VNNARWTTPL EAGDYSVTVF VPYCPNGKSD TTSARYVVQH ADGETQVVVN QAEHAGNWVE LGRYRFDGTG TVSLSDLADD RMKAIWFDSV RWTK
|
| |