Gene Haur_1382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1382 
Symbol 
ID5733274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1596197 
End bp1597192 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID641278520 
ProductSPP1 family phage head morphogenesis protein 
Protein accessionYP_001544155 
Protein GI159897908 
COG category 
COG ID 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.400377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACG CGGTTGCCAT TGCCAATCAA TTCCGTGCTG AGCTTCAGGC GCAGAACGCG 
AGTGCGATGG ACGCGCTGAC CACACGCTGG CGCACGGTCG AAGCGGCGTT GCAGGCCGAA
ATGGAAGCGT TGGCGTTTCA AATGAGTCAG ACCAAAGCTG ATGGGCAAAC GGTCAGTGAA
TCGCAACTGT ATGCGAACGA GCGCTACCAA GCGTTGTTGG TGCAGCTCCA GCGGGAATTG
GCCAAGTACA ACGCCGATGC AGCGGCCATC ATGCAAGCCC AGCAGTTGAG TTTTGCCAGT
ATGGGCGCGG AGCAGGCCAC GGCGTGGTTG CGCTATTCCG GCGCGATTCA AGGGTCATTC
ACGCAGTTGA ACAGCGGCGC ATTTGAAAAC ATTGCCGCGT TGGCACGGGC TGACAATCCC
TTGGCGCTCT TGCTGAGCAA CGCCTATCCG GAAACGGCCC AAGCCATCAC CGATGCCCTG
TTGAGTGGCC TTGCGTTAGG GAAGAATCCG CGCATCACGG CGCGGATGAT GGTCAATGAC
GGCTTAGCAA CCAGTCTCAA TCATGCCTTG TTATTGACCC GTGATCAGTC GATTCGAGCA
GCACGCTTGG CTGCGTTGCA GCAGTATCAA ACGAGTGGCT TAGCCGACGC TTACATGCGC
ATCGCGGCAC GCCAGCGGCG TACCTGTCTG GCATGTTTGG CGCTCGATGG TACGGTCTAT
CCAACCAATG TGATGATGCC GCTACACCCT CAGTGCAGAT GCACTATCGT GCCCATTTTA
AGAGGCCGCG AGCCGCTGGC GATCCCAACG GGCAAGCAAT GGTTTCTGCA ACAGGACGCA
GCAACCCAAC GATCAATGCT TGGTCCAGGG CGCTACGCCT TATGGCAAAA GGGTGTCTTT
CAGTTTGAAG ATTTAGCAAC CGTCCACAGC GGTGGGGTGT GGGGTGCAAA CACCCAAGTC
ACCACGGTCG AAGCATTACG GAGGATGGCG CAATGA
 
Protein sequence
MIDAVAIANQ FRAELQAQNA SAMDALTTRW RTVEAALQAE MEALAFQMSQ TKADGQTVSE 
SQLYANERYQ ALLVQLQREL AKYNADAAAI MQAQQLSFAS MGAEQATAWL RYSGAIQGSF
TQLNSGAFEN IAALARADNP LALLLSNAYP ETAQAITDAL LSGLALGKNP RITARMMVND
GLATSLNHAL LLTRDQSIRA ARLAALQQYQ TSGLADAYMR IAARQRRTCL ACLALDGTVY
PTNVMMPLHP QCRCTIVPIL RGREPLAIPT GKQWFLQQDA ATQRSMLGPG RYALWQKGVF
QFEDLATVHS GGVWGANTQV TTVEALRRMA Q