Gene Haur_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0785 
Symbol 
ID5732669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp885055 
End bp886548 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content49% 
IMG OID641277915 
Producthypothetical protein 
Protein accessionYP_001543561 
Protein GI159897314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTTG GTAGCTTATG GCTGGTTGCC CCAGAGGATT TTTGGTGGCA GCTCAAAATT 
GGCGATCTGA TTCGCACAAG TGGCAGAATC CCCACTGTTG GCGTTTTTTC CGCCACCCAA
GCCAATACAG CTTTTTTCTA CCAAAACTGG CTCAGCCAAT TCCTCTTTTC GTGGATCTAT
CAGCTTGGTG GTTTAGTTGC AATTTTACAA ATCCGCAGCC TTTTATTAAT CGGCAGTTAT
GCTTTATTGC TCTGGCATAC CTGGCGACGG GTAAAAGCCA ATGGGCGGGC CGCCATGCTC
GGTTTGTTGT TGAGTGTGCT GGTAAGTTTC AATCATTGGC AAGTTCAGCC TGCGATGTTT
GTATGGCCGT TGTTTATCGC TAGTTTTGTG ATTGTTAGCG AAGTTGCTGC CGAGCGCTGG
ACAACGAAAT ATCTCTGGTT GCTCCCGATC ATCCAATTAC TTTGGGTCAA CCTGCATGAA
AGTTTCATCT TTGGGCCAAT CCTGGTTGCA ACAGCCGCTG TAGGCGCAAT CATCGATCGA
CGGCGCGACC ATGATGAAGC ACCAATTTAT GCAACTGCCC GAGCGCTGCA AATCGCCACG
GCAACGACGA CCATCGCGAG TTTTATCAAC CCGCATGGCT GGAATGGCTG GATCGCCGCA
TGGCAACAAT TAACCAGCAT AGTTCCCGAA GCTTTACAAA CTCAATCAGG CTCACCACTG
CTGAATTTTG CTACTCCGAT GGCTCAAGTG AGCTTGGCGG TTGGTTTAAT TGCAGCCATG
ATGTTATCGG TAGTTTGGCA GCGCATGCGC AGCGCCGATC TGATCATAAC GGCAATCATG
GCAGCCTTTA GTTTACTCAG CATGCGTTAT CAATTTTGGT TTGGTAGTGT GGCAGGCCCA
ATTATTGCCG AGGCAATTGT GCGCCGTGGC CGCTTACGCC TGATTAAACG TAACCCCAGC
GCACCGATAT GGATTGCCGG ATTAACAATC ACGATTGGTT TGATTGGGCT GCTTATGCAA
CCAATTATTC GTATTTGGCT GCCTTTACCA GCGGCTTTGC AGGGTGCAAC TGGCAATTTG
CCGCAAGCAA CATTAGCGAG TGCTGCTACC CCAATTCAAG CAGTCGAGTT TTTACAGGCC
AATCCACCAA GCCAAGCCTA CTTCCATGAT CTTGGCTATG GCAGCTATTT AATCTGGCAA
GCTGGTGAGC AATTGCCTGT ATTTATCGAT CCGCGAGTTA GCTTATACCC AACCGAACAT
TGGCAAGCCT ATAGTTGTAT TATGGCAGGC CGCGATTGGG AACGGCTGCT AACTCAAGAT
TCAATTGATA CAATATTAGT GGATCGCGGA AACGGCCAAC AATTGATCAG CGCAGTCCAA
GCCAATTCAG CTTGGCGCGA GGTTTATGCC GATCAACAAA GCCTGATTTT CAAGCGTGAT
CCGCAAGCAG CCCAACCAAC TGGCTCAGCC ACGAGCTGTC CAGCAACTAA GTAG
 
Protein sequence
MAVGSLWLVA PEDFWWQLKI GDLIRTSGRI PTVGVFSATQ ANTAFFYQNW LSQFLFSWIY 
QLGGLVAILQ IRSLLLIGSY ALLLWHTWRR VKANGRAAML GLLLSVLVSF NHWQVQPAMF
VWPLFIASFV IVSEVAAERW TTKYLWLLPI IQLLWVNLHE SFIFGPILVA TAAVGAIIDR
RRDHDEAPIY ATARALQIAT ATTTIASFIN PHGWNGWIAA WQQLTSIVPE ALQTQSGSPL
LNFATPMAQV SLAVGLIAAM MLSVVWQRMR SADLIITAIM AAFSLLSMRY QFWFGSVAGP
IIAEAIVRRG RLRLIKRNPS APIWIAGLTI TIGLIGLLMQ PIIRIWLPLP AALQGATGNL
PQATLASAAT PIQAVEFLQA NPPSQAYFHD LGYGSYLIWQ AGEQLPVFID PRVSLYPTEH
WQAYSCIMAG RDWERLLTQD SIDTILVDRG NGQQLISAVQ ANSAWREVYA DQQSLIFKRD
PQAAQPTGSA TSCPATK