Gene Haur_0595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0595 
Symbol 
ID5732493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp687290 
End bp688465 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content48% 
IMG OID641277722 
Producthypothetical protein 
Protein accessionYP_001543371 
Protein GI159897124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC ACCAGTCTGA GCAGGATGTG GGCGTGGATA GTCCAGCTAC GCCATCACTG 
CTTCAACGGA ATCGTTGGTT GCGGCGATTG ATGAAATTTG GCTTTGTGGT GGTTGGCCTG
TTTTTCTTTA TTTTAGCTTT GCGCTTATTA AGTAAAGGTG CAGGTGGCTT GGGGCCATTT
TTAACCAAAA CCTTGGGGAT TGAAAATGCC CTGAATACCT TGGGCTTTGG GTGGCTCTTT
GCCTATGGGG TATTGAGTGG CTCGCCCGTG GCCGGAATTG CGCTTTCTTT TTTAGATAGC
AAAGTAATCG ATCCACTTCA GGCTTTTACG ATGATTACTG GCTCACGGCT GGGTGCATCG
TTTATTGTCT TGGTGATTGG CTTTATCTAC TTTTTGCGGG GCCGTGAAAA AGCCGCCTCG
CTTTCAATTG GCGTGTTGGC GCTTAGCGTT ACAGCTACCA CCTATCTGCC AGCATTGGCG
ATTGGCTATT GGCTCTTGAC TGATAGCGGG CTGGATCAGG TGCGGATCGC CTTGCCATCG
GCAATTTTCG ATTTCGTCGA GCAGGTGTTT GATCCGATTG TGGCTTGGCT GATTCGCACG
ATTGATAGTG GGATTGTGAA TATATTTGGG ACAACTCCCG CGACAACTGG GGTTTTACCG
GCAACGAGTG TGGTAATTTT TGTGGTTGGG GTCGGCACAT TGCTGTTTGC CTTTAACCTT
TTGGATAAAG CATTGCCGCA GGTTGATGCT GAACATAATG CATTTGGGCG GGTTGGTGGC
TTGATCTATC GACCGTGGGC GATGTTTTTG CTGGGAGCGT TCGTTACCTC GTTGACCTTA
TCGGTTTCGG TTTCGCTCTC GATCTTGGTG CCACTTTCGG CTCGCGGCTT TATTCGGCGC
GAAAATACCT TGCCCTACAT TATGGGAGCC AACATCACCA CCTTTATTGA TACCCTGATT
GCCTCATTGT TGATCAAAGA TCCATTTGCT TTTACAGTAG TGCTGGCTGA AATTATCAGC
ATTACAGTAA TTTCATTAAT TATTCTCGTT TTTCTCTATC GCCCATATGA GCGGCTAATT
TTGCGCTTGC TTGATCGGGT AGTCAACGAT ACGCCGATGT TGGTTACCTT TATGGTGGTG
ATGGTCGTGA CCCCGATTCT GCTGTTATTT ATCTAG
 
Protein sequence
MSEHQSEQDV GVDSPATPSL LQRNRWLRRL MKFGFVVVGL FFFILALRLL SKGAGGLGPF 
LTKTLGIENA LNTLGFGWLF AYGVLSGSPV AGIALSFLDS KVIDPLQAFT MITGSRLGAS
FIVLVIGFIY FLRGREKAAS LSIGVLALSV TATTYLPALA IGYWLLTDSG LDQVRIALPS
AIFDFVEQVF DPIVAWLIRT IDSGIVNIFG TTPATTGVLP ATSVVIFVVG VGTLLFAFNL
LDKALPQVDA EHNAFGRVGG LIYRPWAMFL LGAFVTSLTL SVSVSLSILV PLSARGFIRR
ENTLPYIMGA NITTFIDTLI ASLLIKDPFA FTVVLAEIIS ITVISLIILV FLYRPYERLI
LRLLDRVVND TPMLVTFMVV MVVTPILLLF I