Gene Haur_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3352 
Symbol 
ID5735222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4228127 
End bp4230175 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content49% 
IMG OID641280499 
Producthypothetical protein 
Protein accessionYP_001546116 
Protein GI159899869 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0148489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCG ATTCCCAGCA TTTAACCCAA GCCAATCAAT CAACTTGGCA ACGCTATCAA 
ACGGTTATCG CCAGCCTTAG TGCAGGGGTA GCCATCCTGC TTGCATTGTT CAGCTTTGGC
CTTGCTGCAA GCTTCGACTG GCTCGTAGCT AATGCTCAAC CAATTATTGG CTTTGGTAGC
TTTGTGCTTT TGGTGATTGG GCTACCTGGC TGGGCTACAA TTCGCTGGCT TACCCCGCAG
CATGTGCTCA CTCGCAGCGA GCGTTGGGCT TTGAGCTGGG CCGTGGGCAT TGCCTTACCA
CCAATTTTAT TCAATCTCTT TCACATCGTA GGCCTATCAA TTAATCGTTG GGTTGTGGTT
GGCTATGCGT TGCTTGGGCT ATTGTTGACA ATCTGGCCTG AGCCACGGTC AGCTTGGAAT
ACCCGCCTTG CCCAACTCAA ACAAATCCGC ATCAGCAGCC ATGCCTGGAT CTTGCTGAGC
ATCACGCTAG TCAGTATCCT TCAGCGTTTG TTGGCTGTAC GCGAATTGAG CGTAGCCCAG
TGGAACGATG CCTATCACCA TACGATCATC ACCCAACTTT TTTTAGATCA TGGTGGTATT
TTTGAGACAT GGCAGCCCTA TGCTGATTTA AACACCTTCA GCTACCACTA TGGGTTTCAT
GCCAATAGCG CCTTTTTGGC ATGGTGGAGC CAACTGCCCG CCACCACTAG CGTGCTCTAC
ACAGGTCAAC TGCTGGGCAT TGCCACCGGC GTGATGGCCT ATTTACTGGG CCGTCGCTTG
AGCAATCGGC CAAGTGTGGG CTTAATCGCC TTTGGCTTGA CCAGCTTCTA CAACCTCATG
CCCGCATATT ATGTCAATTG GAGCCGTTTT ACCCAATTGA TTGGCCAAGT TATTTTAGTA
GGCTTGGTGG TCATTTGGCT GCTAGTGTTG GAATATCCAC AGTTTAGTTG GAAATTGGTT
GGTCTCGCCA GCGTGCTAAC GACAAGTTTG CTGCTGACTC ATTATTTAGT CACGATATTT
GCGGTGGTTA TGGTTGGTTT CGGCATTTTA GCCTTATTAG CCCGCCAGCC AAGCTTAACC
AATTTGAAGC AAATCAGCTT ACGTGCAACA GCAATTAGCC TTGCAAGCGC TCTGATCGCA
GCACCATGGC TGTATACCAT TATTCAAAGT AAACTTACGG CGATTGCCCG TAACTATGTG
ACTGGTTATA GCGTTGGCTA TGCCACAACG GTAGCAACCC TCGACCAAAT TGTGCCAACC
TATATTAAAG CTCCAATTAT GCTGTTGGCC GTTGTCGGGA TTTGGTTGGC TTGTGCCCAA
CGGGCTTGGC GTATGCTGTT ACTGGTGGTT TGGAGCCTTG GGCTTCAGAT TTTGGCAGTG
CCCTATGTCT TTAACTTACC AATTAGCGGC ATTATTAGCG GATTCGCCGT TTCAATTATG
CTCTATCTGA CGCTGATTCC ATTAGCAGCC TATCCCTTGG GCCTTGTGCT GGAGCGCTTT
AACCAGCAAT GGTATGTCAA AGGCTTGGCT CTGATCGGCT TATATGGGCT GATTGTGTGG
TCTACACCAT GGCAAACTGC GATTGTCAAC GAGCAAAATC GTTTATTGAC ACGGGCTGAT
GAGCAAGCGA TGCACTGGAT TCGCACTACA ACTGAGTCAG AAGCTCGCTT TCTGATTAAT
GGACTTTTTA GCTATGGCGA TGCATTAATT ATTGCCGATG ATGGTGGCAT GTGGATTCCC
TTCCTGACTG GTCGCCAAAC TACCATTCCA CCACTGACCT ATGGCTCGGA AAAAGCGATT
AATCCGCAAC TTGATCGCGA AGTCTACGCC TTATACGATG CATTACGCAC CACCAACCTA
GAGACTGCCG CAGGCCTTGC CTTGCTGCAA CAGCATCAGG TTGATTATAT TTATACTGGG
CCGCATATGG GCAAAAATGC TCAAAAAATT CAACTCAACA CCCAAGCACT CCGCTATCGT
CCTGAACAAT TTCCAATTGT CTACGAGCGC GATGGAGTGG TGATTTTTGC AGTAAAGGCG
CAACAATGA
 
Protein sequence
MQIDSQHLTQ ANQSTWQRYQ TVIASLSAGV AILLALFSFG LAASFDWLVA NAQPIIGFGS 
FVLLVIGLPG WATIRWLTPQ HVLTRSERWA LSWAVGIALP PILFNLFHIV GLSINRWVVV
GYALLGLLLT IWPEPRSAWN TRLAQLKQIR ISSHAWILLS ITLVSILQRL LAVRELSVAQ
WNDAYHHTII TQLFLDHGGI FETWQPYADL NTFSYHYGFH ANSAFLAWWS QLPATTSVLY
TGQLLGIATG VMAYLLGRRL SNRPSVGLIA FGLTSFYNLM PAYYVNWSRF TQLIGQVILV
GLVVIWLLVL EYPQFSWKLV GLASVLTTSL LLTHYLVTIF AVVMVGFGIL ALLARQPSLT
NLKQISLRAT AISLASALIA APWLYTIIQS KLTAIARNYV TGYSVGYATT VATLDQIVPT
YIKAPIMLLA VVGIWLACAQ RAWRMLLLVV WSLGLQILAV PYVFNLPISG IISGFAVSIM
LYLTLIPLAA YPLGLVLERF NQQWYVKGLA LIGLYGLIVW STPWQTAIVN EQNRLLTRAD
EQAMHWIRTT TESEARFLIN GLFSYGDALI IADDGGMWIP FLTGRQTTIP PLTYGSEKAI
NPQLDREVYA LYDALRTTNL ETAAGLALLQ QHQVDYIYTG PHMGKNAQKI QLNTQALRYR
PEQFPIVYER DGVVIFAVKA QQ