Gene Haur_0564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0564 
Symbol 
ID5732285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp653980 
End bp655413 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content56% 
IMG OID641277691 
Producthypothetical protein 
Protein accessionYP_001543340 
Protein GI159897093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAAG CTGTCGAATT GGCAAAGTGC CATGCCGACC CTGCCTACTT CACCCACAAC 
TACGGACGAA TAGACGACGC GCAGGGTCTT GGTGATGGCA GCGGCGATAT GCCGTTTACG
CTCTGGCCTG CGCAGATTGA AGTCCTGTGG ACGCTTCTGC TTCAGCGTCT AATTCTGATC
TTGAAGGCGC GGCAGTTGGG TATTAGCTGG TTGTGTTGTG CCTATGCCCT GTGGCTCTGC
CTGTTCCAAC CGGGCAAGGT GGTGCTGATC TTCAGTAAAG GTCAAGGCGA AGCGGACGAG
ATGCTTCATC GGGTCAAACG GTTGTATGAA CGCTTACCCG ATTGGATGCG CGAAGCCTCG
CCAGCGCTGG TGACGGACAA CACGACCGAA CTGGAATGGG CGAATGGCAG TCGGGTTAAA
TCACTGCCCG CGACCAAAGG GGCAGGGCGT TCGTTCACTG CATCGCTCGT GATTTTGGAC
GAAGCCGGAT TCTTGATTTG GGCTAAGCAG TTGTATACCG CGCTCAAGCC CACGATTGAC
GGCGGCGGCC AACTGATTGT TCTCTCCACA GCCAACGGGA TTGGCAATCT GTTTCATCAA
TTATGGGTCA AGGCACTCAG TGCCAAGAAT CGGTTCAAAA CTATTTTTCT GCCATGGTGG
GCGCGACCAA CCCGTGATGC CCAGTGGTAT CAAGAGCAGC TTGAGGAGTA TACCGACCTT
GATATGGTTC GGCAGGAATA TCCCTCAACC GCGCAAGAAG CCTTTTTGGT GTCAGGGCGC
ACACGGTTCA AAATGCCGTG GTTGCTTCAG CAGACACCGA GTGACGGTTT AGCAGCGGAA
TCGTTACCCG ATGCGCTGGA CAAGCTTGAC GGCGTAACGA TGTATCAATT GCCGCAACAA
GGACGGCGCT ACATTCTGGC AGCGGACGTA GCCGAAGGGC TGGAGCACGG CGACTTCTGC
GCTGCCACCC TAATCGACGC AGTGTCATGG GAGGAAATGG TCAGCGTCCA CGGCAAGTGG
GAACCGGATG AATACGCTCG CATCTTGATG GCGTTGTCGG ATGGGTACGG GGCCACGGTT
GCGGTCGAAC GGAACAATCA CGGCCACGCG GTACTGACGA CGATGAAGCT GGCCGGATTC
ACGCGCATCG TGTATGGCCT TGATGGGCGA GCAGGCTGGC TGACCAACGC CCAGACCAAG
CCACAAATGA TTGACCTGTT AGCAACGGCA CTGCGCGATG TGTTGGTAAA AATTCGCAAT
CAAACGGCGC TGAATGAACT AGCGATTTAT CGGATTTTGA AGAACGGCGG GACAGGCGCA
CCCGCAGGCT ATCACGATGA TTTCGTGATG GCATGGGCCA TCGCTCTCAT GGTTGCCAGT
CAACCAACGG AAGTCGAAGA CGAAGCCATT GCTGGCTCGT GGGATAGCTA CTAA
 
Protein sequence
MSKAVELAKC HADPAYFTHN YGRIDDAQGL GDGSGDMPFT LWPAQIEVLW TLLLQRLILI 
LKARQLGISW LCCAYALWLC LFQPGKVVLI FSKGQGEADE MLHRVKRLYE RLPDWMREAS
PALVTDNTTE LEWANGSRVK SLPATKGAGR SFTASLVILD EAGFLIWAKQ LYTALKPTID
GGGQLIVLST ANGIGNLFHQ LWVKALSAKN RFKTIFLPWW ARPTRDAQWY QEQLEEYTDL
DMVRQEYPST AQEAFLVSGR TRFKMPWLLQ QTPSDGLAAE SLPDALDKLD GVTMYQLPQQ
GRRYILAADV AEGLEHGDFC AATLIDAVSW EEMVSVHGKW EPDEYARILM ALSDGYGATV
AVERNNHGHA VLTTMKLAGF TRIVYGLDGR AGWLTNAQTK PQMIDLLATA LRDVLVKIRN
QTALNELAIY RILKNGGTGA PAGYHDDFVM AWAIALMVAS QPTEVEDEAI AGSWDSY