Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1382 |
Symbol | |
ID | 5733274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1596197 |
End bp | 1597192 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641278520 |
Product | SPP1 family phage head morphogenesis protein |
Protein accession | YP_001544155 |
Protein GI | 159897908 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.400377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACG CGGTTGCCAT TGCCAATCAA TTCCGTGCTG AGCTTCAGGC GCAGAACGCG AGTGCGATGG ACGCGCTGAC CACACGCTGG CGCACGGTCG AAGCGGCGTT GCAGGCCGAA ATGGAAGCGT TGGCGTTTCA AATGAGTCAG ACCAAAGCTG ATGGGCAAAC GGTCAGTGAA TCGCAACTGT ATGCGAACGA GCGCTACCAA GCGTTGTTGG TGCAGCTCCA GCGGGAATTG GCCAAGTACA ACGCCGATGC AGCGGCCATC ATGCAAGCCC AGCAGTTGAG TTTTGCCAGT ATGGGCGCGG AGCAGGCCAC GGCGTGGTTG CGCTATTCCG GCGCGATTCA AGGGTCATTC ACGCAGTTGA ACAGCGGCGC ATTTGAAAAC ATTGCCGCGT TGGCACGGGC TGACAATCCC TTGGCGCTCT TGCTGAGCAA CGCCTATCCG GAAACGGCCC AAGCCATCAC CGATGCCCTG TTGAGTGGCC TTGCGTTAGG GAAGAATCCG CGCATCACGG CGCGGATGAT GGTCAATGAC GGCTTAGCAA CCAGTCTCAA TCATGCCTTG TTATTGACCC GTGATCAGTC GATTCGAGCA GCACGCTTGG CTGCGTTGCA GCAGTATCAA ACGAGTGGCT TAGCCGACGC TTACATGCGC ATCGCGGCAC GCCAGCGGCG TACCTGTCTG GCATGTTTGG CGCTCGATGG TACGGTCTAT CCAACCAATG TGATGATGCC GCTACACCCT CAGTGCAGAT GCACTATCGT GCCCATTTTA AGAGGCCGCG AGCCGCTGGC GATCCCAACG GGCAAGCAAT GGTTTCTGCA ACAGGACGCA GCAACCCAAC GATCAATGCT TGGTCCAGGG CGCTACGCCT TATGGCAAAA GGGTGTCTTT CAGTTTGAAG ATTTAGCAAC CGTCCACAGC GGTGGGGTGT GGGGTGCAAA CACCCAAGTC ACCACGGTCG AAGCATTACG GAGGATGGCG CAATGA
|
Protein sequence | MIDAVAIANQ FRAELQAQNA SAMDALTTRW RTVEAALQAE MEALAFQMSQ TKADGQTVSE SQLYANERYQ ALLVQLQREL AKYNADAAAI MQAQQLSFAS MGAEQATAWL RYSGAIQGSF TQLNSGAFEN IAALARADNP LALLLSNAYP ETAQAITDAL LSGLALGKNP RITARMMVND GLATSLNHAL LLTRDQSIRA ARLAALQQYQ TSGLADAYMR IAARQRRTCL ACLALDGTVY PTNVMMPLHP QCRCTIVPIL RGREPLAIPT GKQWFLQQDA ATQRSMLGPG RYALWQKGVF QFEDLATVHS GGVWGANTQV TTVEALRRMA Q
|
| |