Gene Haur_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0938 
Symbol 
ID5732824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1073141 
End bp1074901 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content51% 
IMG OID641278070 
Producthypothetical protein 
Protein accessionYP_001543714 
Protein GI159897467 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000915423 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGGC AGCACTTTCG CTATCAACGC CTCGCACTAC TCAGCCTTGT GGCTTTGCTG 
GGAGTGTTAG TCACACCAAC CAAAGCCGTC GAAATCCCGC CAACAGGCGC ATCGGTGTAC
CTCCAAACCA ATGGGCCTAC CAATACCTTG AATAACGGTG ATTGGTACAC CAATAGTCTG
GCAGGTGCAG GTAATGGTTA TCATTACTTT ACGGTTGATA TTCCATGTGC TTGGCCGAGC
ACTGAGCCAG TCCATATCGA TATCTTTAGC CCTGAAATGA ATAGTAATGC CCCCCTCAGC
GACGAGATTC GTGGTGGGGT GTACGATAAT ACGCAATTTG AGTTCTATGC TGCTGGCACG
CCGATTGTTG TTCCAGCAAC GCCTGGCCCA GGTGCGGCTG GTAGTTTGAT TCAACAAACC
TTTGTGCCTG CTGGCACGCC TGAGGCATGG TTGCGCTTCT ACACAATTGC CGCACCTGTC
ACTTGTGGTA CGTATGTTTT GCGTTCAGCC ACTTCCGGCA ACGACGAAAA TGGTTGGCGC
TTGCGGGTTG GCCGCGATAA CGACGCTGAT CCCAATAACG CGCCACCAGC CAACACCGAT
AATTTCGATG GTGTAGCTGG TACTGGCGAT GAAATTACCC TTGGGATGCG TCAGGCTTCG
TTCCAACACG ACGCTGGTGC AGCAGATGTT GTCGCCACAT GTTTAACCTT GTATGAATAT
GTTACTCCTG GTCAGCCCAG CGTTAGCTTT AATAATTTTG ATATCGATAA TGTCCGTCGG
GTGCGTTATT ATGCGCCTGG TGATGCGAGC TACACGCCTA TGGGCAATAG CGGTGGCATC
GTGGGCAGCC TTAGCAACGA CCAAATTTGG AATGGCACTG GCGCAACCTT AGCCACCCGC
GTCGGCGATA CGATCAATAA CCCAGTTTCA GGCTGGTGGC GGATCGTAAC CTGTACCAGC
AATCACAATC AGTTTATTCA AGAAGGCCAA ACTGGCACAC CAGCCTACTA CGAACAGCCA
CCAACGCCAG TCATGGCCTT GAGCAAAACC GATGGCGTTA CCTTGGTCTT ACCAGGCGAT
ACGCTGAATT ACACCATCGC CTTTACCAAT ACTTCCAACA GCACGGCCAC GCCAGGTAGT
GCAACCAACG TTACCTTAAC CGACAATTTA CCACCCGACA CGACCTTTGT CAGTTGTGCG
ATTAATCTGC CATTCACTGG TACATGTAAT CATGCTGCTG GTGTGGTGAC CTTCAATATT
ACCCAAATAG TTCGCCCAGG CGAAGTTGGC ACACTCAACG TTCAGGTAAC CGTCAACGAT
CCGATCACCA CGGTTCCGGT GGTCAACAAT GTTACCTTGA CCTTCAACGA TACCTTGAAC
AATGTGTTCC AACCATTGAA CGCCAGCGAT AGCGATTTGG TCAATCCAAC TGCGGTAACT
GTGGTTGGCT TCACGGCCTT GGTACGGGTC GATGATATTC AAGTGCGCTG GAGCACTAGC
CAAGAATTGG AAACCCAAGG CTTCCATATC TATCGGAGCA CCAGTGATGA CCCAGCGACC
GCCGTCCAAG TGACTGAGAA CTTGATTCCA GCCTTGGGTG CGCAAACCAA CTATCAATGG
CTTGACACCA ACGCTGAGCC AAATGTGCAT TATTACTATT GGTTGGTTGA AGTCGATGCC
AACAATAATT TGAGCATGAT CGGCCCAACC GATGCACAAA TCGAGCGCTA CAGCATTTTC
ACTCCCTTCG TTATACGCTA A
 
Protein sequence
MGRQHFRYQR LALLSLVALL GVLVTPTKAV EIPPTGASVY LQTNGPTNTL NNGDWYTNSL 
AGAGNGYHYF TVDIPCAWPS TEPVHIDIFS PEMNSNAPLS DEIRGGVYDN TQFEFYAAGT
PIVVPATPGP GAAGSLIQQT FVPAGTPEAW LRFYTIAAPV TCGTYVLRSA TSGNDENGWR
LRVGRDNDAD PNNAPPANTD NFDGVAGTGD EITLGMRQAS FQHDAGAADV VATCLTLYEY
VTPGQPSVSF NNFDIDNVRR VRYYAPGDAS YTPMGNSGGI VGSLSNDQIW NGTGATLATR
VGDTINNPVS GWWRIVTCTS NHNQFIQEGQ TGTPAYYEQP PTPVMALSKT DGVTLVLPGD
TLNYTIAFTN TSNSTATPGS ATNVTLTDNL PPDTTFVSCA INLPFTGTCN HAAGVVTFNI
TQIVRPGEVG TLNVQVTVND PITTVPVVNN VTLTFNDTLN NVFQPLNASD SDLVNPTAVT
VVGFTALVRV DDIQVRWSTS QELETQGFHI YRSTSDDPAT AVQVTENLIP ALGAQTNYQW
LDTNAEPNVH YYYWLVEVDA NNNLSMIGPT DAQIERYSIF TPFVIR