Gene Haur_4606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4606 
Symbol 
ID5736453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5891082 
End bp5892836 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content52% 
IMG OID641281770 
Producthypothetical protein 
Protein accessionYP_001547365 
Protein GI159901118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000237555 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCAAAT TTGGTTTATT ACTGGTGATG CTCTTTGGGC TTGTTACTCC TGCTCCGGCT 
GCTCCCGATG CGCCGTCTGC TCAGCCCAGC ACAATCACGC CTTGGGGCAT CAATGGCTAC
CTCACCAAAA ATGAGCGGGT GGCTGGTGGC GATAATGTGG CGCAATTAGC CCAAACCATG
GCCTCGGCCA ACGCCGATTG GTCGCTCGAA GAGTTGCCAT GGGCCGAAAT CGAGCCAAAC
AATGGCACAT TTCGCACCAC CTACGATAGC CGAATCAAAA CCGTCGCCGA TGCCAACCTT
GGGATTATTG GCATGCTGCT GACGACCCCA GCGTGGGCAC GCGAATCTTC GTGTGCTGGC
AATAACAACT ACTGGTGTCC GCCGAGCGAT CCGGCCCAAT ACGCCGAGTT TGCCGCTTGG
ATGGTCGAGC GCTACGATGG CGATGGAGTC AGTGATGCCC CAGGCTCGCC GCGCATTGCT
GCTTGGCAAA TTTGGAACGA GCCAAACTTT GTTGCTACCT GGAGTTCAAT CAACAACAAC
GAGGCTTTGC GCCGCCGCCG TTATGGCGAA ATTTTGGTCG CCGCCTATAC CGCGATCAAA
CAAGCTGATC CGACGGCGAT TGTGGTAGCT GGCGGGGTGT ATGTGTTCGA TGGCTTTAGC
GATGGCTTCG ATTTTCTCAA TGGCACAAAT GGGGTGTTTC GCCAAATACC TGAGGCCAAA
ACCAGCTTTG ATGTGCTGGG AATTCACCCC TACATGCCGA CGATTGCCCC CGATGCGATC
GGCACATTTT CAAGCGTCAC GCTCGAAGGG CGTTTACTCA ACACCCGTAA TTGGTTGACT
AACGATATTG GCCGTCCTAG CGCCCCAATA TGGATCACCG AGATTGGTTG GTGTACCAGC
CCTGGCTCGA CCAGTTGCCC CGTGGTTAGC GCCGAAAATC AAGCTCGCTA CCTAATTCGT
AGCTTTGTGA TTGCCCAACA ATTAGGTGTG CAGCATATTA ATTGGTTGCA ACTTGAAGAT
GCTTTCAATG GTTCGCACCC ATTTAGTGGC TCGGAATTGC TTAATGATTT AAATACGCCT
GAGCCAGAAA AACAAGCTTA CACCGCTTTT CAAACCATGG CTGGTTTATT GAATGAGGCC
ACCCCGCTTG GCATTCGAAC TGGCGTGCAT ACCCACAATT ATGTTGCCAA TAGCAATAAC
ACTGGCGGCG TGTATGCCTA TCGGTATAGC CGTGGCAACA CCGAAATTGA TGTGCTTTGG
ACTCCCGGGG CTAATACCAC CATTCAGTTC CCGCTGACTG CTGGCAAGCA ATGGATTTTC
CGCACCCGCA ATAATCAATC GTTTAGCCCA ACGATTAATG GCACAACCGC TAGTATTGTG
CTGACCAACG ACCCAATTTT TATTGTGCAG AAAATTCCGG TCAGCTTGAA TGTGCAAAAT
TCGCTGAGTT TATTGGCCGA AGTTAGCAGT GGCGAGGCAC GGGCCGATAT TCCGTTGAGC
AATGGCGGCA GCGAAAATGC CCTCGTTTGG AATATTAGCA ATGCTTCGGC GGGTTTAACC
GTTACGCCGA GCAGTGGTAG CGTGGTTGGC ACCGCCAATT TGACGATCAA GGCACAAGTT
GGTAGCCTAA GTGCCGGAAC CTATAGTTAT CAGTTTACGG TTAATGGTGA TGGTGTTGCT
TCACGCACCG TTCAAGTAAG CCTGCGGGTG GTTGATCAGC TTCAGCCCAT CTATTTGCCG
CTGACTAGAA AATAG
 
Protein sequence
MRKFGLLLVM LFGLVTPAPA APDAPSAQPS TITPWGINGY LTKNERVAGG DNVAQLAQTM 
ASANADWSLE ELPWAEIEPN NGTFRTTYDS RIKTVADANL GIIGMLLTTP AWARESSCAG
NNNYWCPPSD PAQYAEFAAW MVERYDGDGV SDAPGSPRIA AWQIWNEPNF VATWSSINNN
EALRRRRYGE ILVAAYTAIK QADPTAIVVA GGVYVFDGFS DGFDFLNGTN GVFRQIPEAK
TSFDVLGIHP YMPTIAPDAI GTFSSVTLEG RLLNTRNWLT NDIGRPSAPI WITEIGWCTS
PGSTSCPVVS AENQARYLIR SFVIAQQLGV QHINWLQLED AFNGSHPFSG SELLNDLNTP
EPEKQAYTAF QTMAGLLNEA TPLGIRTGVH THNYVANSNN TGGVYAYRYS RGNTEIDVLW
TPGANTTIQF PLTAGKQWIF RTRNNQSFSP TINGTTASIV LTNDPIFIVQ KIPVSLNVQN
SLSLLAEVSS GEARADIPLS NGGSENALVW NISNASAGLT VTPSSGSVVG TANLTIKAQV
GSLSAGTYSY QFTVNGDGVA SRTVQVSLRV VDQLQPIYLP LTRK