Gene Haur_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0077 
Symbol 
ID5731950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp100187 
End bp101365 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID641277199 
Producthypothetical protein 
Protein accessionYP_001542857 
Protein GI159896610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATGG AAGGGTTTCG TTGGTTGTTG AGCGACGCTG GCCAACAACT CTTGGCTGAA 
CTCAGCAATG ATCGCGATTT GAATGAAGCT AACTTTTTGC GCTACACCAC CAAATTGCGC
AAACACTACC CAGCCGAGGC GGTAACGGCG GCTTTGGAAA CCAACTTATT GCGCCGCGCC
GCCCAAGCTA AATTTCCCCA AGCCAGCCAA CTATATTTTA CCCGTGAGGC CTTGGAGCAA
GCTACGCCTT GGCTGGTTGC TAGCTATCGC CAACGCCATT TTGCGACTGG CTCGCGGCTG
GTCGATTTGG GTTGCTCGGT TGGCGGCGAT GCCTTGGCCT TAGCGCAAAG TTGCTCGGTT
TTGGCGATCG ATCGTGATCC ATTGCGGTTG GCAATGCTTG AGGCCAATGC TCAGGCGCTT
GGGCTAAGCC AGCAAATCAG CATCCAAGAG GCCGATTTTA CCACGCTAGA ATTTGCGGGC
TACGCTGGTT TATTTATCGA TCCGGCGCGG CGCAGCAATG GCAAGCGCAT TTGGGATGTT
GAGCACTATC AGCCGCCGCT TTCTACCTTG GAGCGTTGGC GTGGGCAAGT GCCAATCCAT
GGAGCTAAAG TTGCCCCAGG TATTCCCGAT GATGCCGTGC CTGCTGGCTA TGATCTTGAG
TTTATTTCGC TTGATGGCGA TTTGCGCGAG GCCTGTTTGT GGTGGCAAGC TGGTCAGGTT
GGCGGGCAAC GCAAGGCAGT GGTGCTCACT AGTGCTGGTG CTGAACACAG CTTAATTGCC
GATTCCACCC AAGCCGCCGC TGCACTCAGC GAGCCACTGG CCTATTTGTA CGAGCCTGAT
CCAGCGGTGA TTCGAGCGCA CGCCGTGGCA GATATTGCCA ATCAGTTGGA TTTAGCTCAA
TTTGATGCCA GCATTGCCTA CCTTACCAGT GATCGCTTGG TACAATCGCC ATTTCTGCGA
GCTTGGCAAA TTGAGCAATG GCTACCGTTT AATTTGAAAC TTCTGCGCCA AATATTGCAA
GCGCGTGAGA TTGGCCGCGT AACCGTCAAA AAGCGTGGCT CGCCGATTAC CCCCGAAGAA
TTAAGCAAAC AACTGCGCTT GAAGGGTCGC TACGAGCAAA CGTTGGTGCT GACCAAACTA
CAAGGCCAGC CAGTCGTGCT GTTGGTAAAA TTGCTTTAA
 
Protein sequence
MEMEGFRWLL SDAGQQLLAE LSNDRDLNEA NFLRYTTKLR KHYPAEAVTA ALETNLLRRA 
AQAKFPQASQ LYFTREALEQ ATPWLVASYR QRHFATGSRL VDLGCSVGGD ALALAQSCSV
LAIDRDPLRL AMLEANAQAL GLSQQISIQE ADFTTLEFAG YAGLFIDPAR RSNGKRIWDV
EHYQPPLSTL ERWRGQVPIH GAKVAPGIPD DAVPAGYDLE FISLDGDLRE ACLWWQAGQV
GGQRKAVVLT SAGAEHSLIA DSTQAAAALS EPLAYLYEPD PAVIRAHAVA DIANQLDLAQ
FDASIAYLTS DRLVQSPFLR AWQIEQWLPF NLKLLRQILQ AREIGRVTVK KRGSPITPEE
LSKQLRLKGR YEQTLVLTKL QGQPVVLLVK LL