Gene Haur_4491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4491 
Symbol 
ID5736342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5751451 
End bp5752899 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content50% 
IMG OID641281654 
Producthypothetical protein 
Protein accessionYP_001547251 
Protein GI159901004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAGG CACATTTATT TGGGTTTGGC GGAGCACTCT GGTTAGCCTT GTTTATTGCC 
ACCCGTACAC CCCAACGCTC GCTGCTGTTT TGGGCCAGCA TTATCAGTCT ATTTGGCTTG
ATGGGCTTTT TTGGCTCAGG TGTGCTCGGC GGCGAAGGGC TGAATCTTGA GCAACTGGTC
AGTTTAGAAC GCGGGTTTTG GTGGAGTTCG GTACTGCCAA TTACGACTTG GCTGATCGTA
TGTAGTTTGA TTCACCAAAC CTTGCAACCA AGTATTGCCA CGATCGTCAA GCCCGAACGC
TGGGTCGTCG GCGTTATTGG ATTAATCCTC ATTGGCTTGG GCAGTTCCAG CAATTGGTTG
ATGAATTATG CCGAGCCAGT GCTGCTTGCT AGTGGTAGTC AAATTATCGG CACAGGCCCA
GCCTACCCAA TCTATAGTGC CTATGTGATG GGCTGTGTCA GCGTGGCGCT TTGGCATTTG
GTGGCAAGTT GGCGCATTGC CGAAACAGGT ATGGCGCGGC GCAGTTTAGC TAGTTTGGTG
TTGGGGGCTT TAGGGTTCTT AATTGGCACA AGTAGTTTGT TGGCTCGCTT AATCAGCACG
GGTACATGGC CACTTTTCTA TGGGTATATG CCGATTTTTG CTGGCTTGTT GATTACGGGT
TTTGGGTTGG TGCGATTTGG CTTATTGCTC CAAGGCCAGA ATGTGCTGCG CGATTTGATT
TATAGTTTTT GCGAAATTAG CATCCTCGCC TTGATTTATT TAATTAGTGT GAATATTTTA
GATCTGCTGC GGCCTAGCCA ATTGGCCTTG CTTTTGGCGT GTGTGATCAT CAGCCACACA
GGGTTGGATC GTGGGCGACG TTGGCTTGAT CGTTTGTTCT TTTCGCGAGC TGAACAAGAG
GCTCGTAGCC AATCGCGCGA ATTTGCGCTT GCCTTGGCCT CAACTCCTAC GCCAACCCCC
GCTCCAGTAA TTGTTGATGC GAAGCCCGAT AAAGCCTGGA ACGATGCAGT GCGGCGAGCG
ATCAGCGGCT TGAAAAATCC AGTTCAATTA GCCCAAAATC CCTTGCTAAG CAGCGCTTTG
GTTAGCCATA GCGTGCAGAG TAAAGCGCTA GAGGATAATC GGCTGAATCG CAGTGCAATT
GCCCGCGAAA TATTATTGCA AGCAATCGAG CAACTGCGGC CTGATGCCAG CCAAGCCTTA
GGCAGTGGCG ATGCTTGGCG TTGGTATAAT GTGCTGTATC TGCCCTATGT GCGCGAAATC
AACCGCAAAA CTGCGATCGA TTGGCTACGG CGCGGCCTCA GTGACCCATT AATTGATGCG
AGTGTGTTAA GTTGGCTAGC TGATATTGAT GAAGATACCT TTTATAAGTG GCAGCGCCGC
GCCTCAGATT TGATCGCGGC TCAATTGTGG GAGCAACAGT TGAAGTTGGT GCAACCAGTT
ATTGCGTAA
 
Protein sequence
MVEAHLFGFG GALWLALFIA TRTPQRSLLF WASIISLFGL MGFFGSGVLG GEGLNLEQLV 
SLERGFWWSS VLPITTWLIV CSLIHQTLQP SIATIVKPER WVVGVIGLIL IGLGSSSNWL
MNYAEPVLLA SGSQIIGTGP AYPIYSAYVM GCVSVALWHL VASWRIAETG MARRSLASLV
LGALGFLIGT SSLLARLIST GTWPLFYGYM PIFAGLLITG FGLVRFGLLL QGQNVLRDLI
YSFCEISILA LIYLISVNIL DLLRPSQLAL LLACVIISHT GLDRGRRWLD RLFFSRAEQE
ARSQSREFAL ALASTPTPTP APVIVDAKPD KAWNDAVRRA ISGLKNPVQL AQNPLLSSAL
VSHSVQSKAL EDNRLNRSAI AREILLQAIE QLRPDASQAL GSGDAWRWYN VLYLPYVREI
NRKTAIDWLR RGLSDPLIDA SVLSWLADID EDTFYKWQRR ASDLIAAQLW EQQLKLVQPV
IA