Gene Haur_5036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5036 
Symbol 
ID5736995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp49805 
End bp51475 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content56% 
IMG OID641282203 
Producthypothetical protein 
Protein accessionYP_001547794 
Protein GI159901548 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0701972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTA CCTGGAAATA CGGTCGTATC CTTGGGCATG GAGTCATTGG CAGGCTGTTC 
GGCATGATCC TGCTAAGAAT TTCTGAACGC GAGGCATCGA TCATTCTTCA TCGAATCCAA
GCCGCGCTCT ACTCTAGCAG ACCGTTCTAT AGTATCCACC ATCGCTACAA TGCTGCAACG
GATTGGCTGT TCGCATACTT TTTTCCGGAC AACCTTATGG GTGACCTCCT CATTATCTTT
CTGAATCGAA GGAGACCTAT GGCTTTCATT ACATCTTCAG ATCTTTGTTT GGTGCTGATT
CATCGTTCAC TCCGTGGCCG ACTCTTGGTT TGGCAGGCAG CGATTATGGT GGCTATGGCG
CTGCTCCCTG GGAATAGCCT CGCCGCTAGC CCGAATCAAA GCTACGAGGT TGGCCCTGGG
CGGACATACG CCCGTCTTTC CGATCTGGTC AGCGCCGACG TGCTCGGCCC TGGCGATACA
GTGCTGGTCT ACCCGAATGG GACGGCATCC TACAATGACA CTGTGATCTT CGACACCCAT
GGCACGGCTG ATCGCCGGAT CACGATCCGC GGGGTGCGAG TCAACGGACA GCGCCCTATC
CTGTCCAGCA ACAACAACTA TGGAATCGTC TTTAAAGGTG ACCATTATAT CTTTGAGGGA
TTTGAAGTCA CGGGTGCCGT CGGGAACCAG TATGTGATCA TCCACCGTGC AGACCACATC
CTCATCCGTG ATACACTCAT CCGTGACTGC CCCGGTACAG GACTGCTCGG CCACGACGAG
GACGCAGGCT CCCTTACGCT CGATCACGTT GAGGTAACCA ACTGCGGCAA CGGCCTCTAC
CAGCATCCCA TTTATATGAC AACTGGCTTG CCCGGCGCGG TGTTTCGAAT GCAATACTGC
TACCTCCACA ACCAGAAAGG CGGCAACGGT GTCAAGAGCC GTGCCAACCG CAACGAGATC
TACTATAACT GGATCGAGGG CAGCTACTAC CACGAGCTGG AGATGATCGG TCCCGACGAT
GGCACGGGCG GCTCGCCCGA CTCGCCGCGC CACTCGGATG TGGTCGGCAA TGTCTTTATC
AAGAAGCAGG ACTTTGCGAC GCTCGTCCGT ATTGGCGGCG ACGGGACAGG ACAAAGCTGG
GGGCGCTATC GTTTCGTCAA CAATACCCTA GTTGGTCGCA GTGATGGAGC CGTCGCCATC
CGGGCCTTTG ACGGGCTGCA AAGTATCGAA CTGCACAACA ATGTCTTCAC CAACGCCAAC
GGCACGGGAA TGCGTATCAT TAGAGACACC GAGGCCACAT GGCACAATGG CTTGCGGGTA
GTTGCTGGGA TCAATAACTG GATTCAAGCA GGCTCGGTCA GCGCCCCAGA ACTCATCGGC
ACTATTCAGG GCACCGATCC TCAGTTCGTA AATCTGGCGA CTGGCGATGT TCGCCCGTCT
ACGAATAGTC CGCTGATCAA CGTTGGGACA TCCAACCCCG CCAGCCCAAC TGGCTACCCG
TTCCCCTCGC CGCTCATGCT GCCGAAGCAA CATCCGCCGC TCCGGACTAT TGCGCCGGTA
ACAGTGGTCG ATGCGCGGCC AGTCGTCGGT GCAATCGATG TTGGTGCCTA CGAGATTGGG
ATACCGCCAC TCTACACCAA TCGCGTCTAT ATCCCGATCA TAAAACGCTA A
 
Protein sequence
MAATWKYGRI LGHGVIGRLF GMILLRISER EASIILHRIQ AALYSSRPFY SIHHRYNAAT 
DWLFAYFFPD NLMGDLLIIF LNRRRPMAFI TSSDLCLVLI HRSLRGRLLV WQAAIMVAMA
LLPGNSLAAS PNQSYEVGPG RTYARLSDLV SADVLGPGDT VLVYPNGTAS YNDTVIFDTH
GTADRRITIR GVRVNGQRPI LSSNNNYGIV FKGDHYIFEG FEVTGAVGNQ YVIIHRADHI
LIRDTLIRDC PGTGLLGHDE DAGSLTLDHV EVTNCGNGLY QHPIYMTTGL PGAVFRMQYC
YLHNQKGGNG VKSRANRNEI YYNWIEGSYY HELEMIGPDD GTGGSPDSPR HSDVVGNVFI
KKQDFATLVR IGGDGTGQSW GRYRFVNNTL VGRSDGAVAI RAFDGLQSIE LHNNVFTNAN
GTGMRIIRDT EATWHNGLRV VAGINNWIQA GSVSAPELIG TIQGTDPQFV NLATGDVRPS
TNSPLINVGT SNPASPTGYP FPSPLMLPKQ HPPLRTIAPV TVVDARPVVG AIDVGAYEIG
IPPLYTNRVY IPIIKR