Gene Haur_1752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1752 
Symbol 
ID5733639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2038634 
End bp2040031 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content51% 
IMG OID641278894 
Producthypothetical protein 
Protein accessionYP_001544523 
Protein GI159898276 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000233386 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTA TGATCGTTTT TGCTATTGTT GTCTTGTTGA TGGCCGCGTG TAGCTCGCAA 
TGGCCTAGCG CCCCAATTGC CAAAAGTACT CCTCATAATC CAGCCAATTT TCCTCCCAGC
CCGCAAAACT CACCTACCAT TGGTCAATGC CCGGTTTTTC CACTTGATAA CATTTGGAAT
ACCCCAATCG ACACCTTGCC TGTCCACCCT CGTTCGAGCC AATATATTGC CAGCATTGGC
GGCAGCGAAA CCTTGCACCC TGATTTTGGC GCGGCCCAAT GGAATGGTGG TGATATTGGG
ATTCCCTATG TGGTTGTGCC CGCTAACCAA CCAACTGTGA CCGTTAATTT TGTCGATTAT
CCGCATGAGA GTGACCCCAC CACTGGCTCA GGCCAATACC CAGTGCCACC GAATGCGCCA
CGTGAGCATG GTAGCGACCA TCATGTGTTG GTGGTGCGTG AAGGTGAATG TAAGCTGTAT
GAGCTGTACA ATGCCACCAA AATTAACGAT ACAACTTGGA ATGCCAGTAA CGGAGCAATC
TTTGATTTAC GCTCGAATAG CTTACGACCT GATACCTGGA CTTCCGCTGA TGCAGCAGGT
TTGCCAATCT TGCCTGGCTT AGTGCGCTAC GAAGAGGTGC AAGCAGGCGA GATCAACCAT
GCAATTCGCT TTACGATTCA GCGTTCACAA CGGGCCTATG TCTGGCCAGC GCGGCATTTT
GCGTCATCAA TTACCGACCA AAATGTGCCA CCAATGGGTA TGCGCTTTCG GCTCAAGGCA
TCGTTTGATA TTTCGGGGTT TTCCAGCGAG ATGCAAGTTA TTTTGCGGGC AATGCAGCGC
TATGGCATCA TTGTGGCCGA TAATGGTTCA GATTGGTATA TTTCTGGCGC ACCGAATCCC
AATTGGGATG ACGATAATTT GGTGAGTAGT TTCGACCAAA TTCGCGGTGA TCATTTTGAA
GCTATGGATA GCTCGAGCTT GCAACTCAAC CCTGATTCGG CAGCGGTTAT CGCTAGTGCT
GCCCCGCAAC CAAGCAAATT GGCCGAATTT GGGGGCGTTG ATCAAGGCCA ACAATTGCGC
TATGCGATTA CAGTGGTTGG CACAGGCAGC CCACAAACCA TGAATGATCA ACTGCCTGAT
GGCCTAACAA TTGTTCCGGC GAGCGCCACG ATTAACCCAA GCAATTTGGC GGCCCCAGCC
ATTAGCAACA ATAGTGTTCA GTGGAGCGGC ACAATTCCCA ATTCGCAGAG TGCGGTGATT
AGCTTTCGTG CAACGGTCAG CACCAACGAG CGCCGCGTTA TTATCAATAC TGCCCAGATT
AATGCTGCCA CAGTGCAGGC CAGCATTATT GCCAATGGCT ATCGTGTTTG GTCGCCCATG
GTGCGCAAAC TGAAGTAG
 
Protein sequence
MRRMIVFAIV VLLMAACSSQ WPSAPIAKST PHNPANFPPS PQNSPTIGQC PVFPLDNIWN 
TPIDTLPVHP RSSQYIASIG GSETLHPDFG AAQWNGGDIG IPYVVVPANQ PTVTVNFVDY
PHESDPTTGS GQYPVPPNAP REHGSDHHVL VVREGECKLY ELYNATKIND TTWNASNGAI
FDLRSNSLRP DTWTSADAAG LPILPGLVRY EEVQAGEINH AIRFTIQRSQ RAYVWPARHF
ASSITDQNVP PMGMRFRLKA SFDISGFSSE MQVILRAMQR YGIIVADNGS DWYISGAPNP
NWDDDNLVSS FDQIRGDHFE AMDSSSLQLN PDSAAVIASA APQPSKLAEF GGVDQGQQLR
YAITVVGTGS PQTMNDQLPD GLTIVPASAT INPSNLAAPA ISNNSVQWSG TIPNSQSAVI
SFRATVSTNE RRVIINTAQI NAATVQASII ANGYRVWSPM VRKLK