Gene Haur_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0109 
Symbol 
ID5732002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp141655 
End bp143475 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content37% 
IMG OID641277231 
Producthypothetical protein 
Protein accessionYP_001542889 
Protein GI159896642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTT ATGCAAAGGC TTGTATGGGC ACATTTAAAT TAACATCAAT CAAAGATCGG 
TATATTATTG TATTTGGAAC ATTTTGTATA TTTCTATGCT ATGCACTATT TATCTTCAAT
TCTGATGTGG CTTATTTAGC CTTTACTCAA CCTCAAGATG ATTCGTTTTA TTATTTCCTA
CCGGCCTGGA ATTTCAAAGC CTATGGATTT TATACCTTTG ACGGTTTGAA TGAAACATAC
GGCTTTCAGC CATTATGGAT GGTTTTACTA ACCATAATGG CTTTTTTTAT ATCAAGTAAA
ATTCTATTTT TCAAGCTTTC GCTTCTAGTG GGATGCCTGT TCTATCTCTT AACTGGCTTA
ATTATCTACA AAATCAGCAA GTTATTGATC AGGCAACGGC TACTAACATT TATTCCATGG
TTTGTTTGGG TTAGCAATAT CTACTTATTA CGTATATTTG CTTCAGGTAA AGAAAATTCA
CTCTCAGTTT TTTTATATGC ACTGATTGCT TTAAGCCTTC TCAATATCCA TTTTAAAAAT
CGCCAACAAT CAAGCCTGTA TGGGTTAATC GATGCTAGCT TTATGCTGGT TCGGATCAAC
AATCTGATGT TTGTTGGGCC AATTCTGCTC TATCGGCTCT ATCATAATCG TCAGCAAAAA
TCCCAGATTG GATTCTATTT AGCAACCTTT AGTTTAGTAT TAAGTCTATG GTTTGGCTAT
AGCTATGCTG CATTTGGTAC GCTTTTTCCC AATAGTGGCA GCTTAAAGAC AGTTATTTCT
AAACCTTCAA TCGTCTATTG GCTTAATCAA CAAACTGGCG TTGAGCTGGG TGGCATGCTT
AGCTCACAGG AACAATTACT CTTACAGCAT CCTGAATTTT TAGATGTTCC GCGGGCGAAT
TTCTTCTGGC AATATCTTAG TCAAATTGTG CCCCAAAAAG TTACAGATAT TTATTTTGAT
CAAAAATTCT CGTCAATAAC GCTTGTTTCA ACGCTCAATC AAGTTCTTTT GCTGAGTATT
GGGCTTGGAT TAATTGGATT TTTAGGTGGA TTATGGCAAC GAAGCATTAC CATCAATCAA
CAACTAATCA AATTATTAAG CTATTGTGGC ATCATTGCCT TAGCAAATAG CGTGGTTAAT
TGGCTATTTT TTCAACGTTA CCTCAGCTTT ACGATTTGGT ATCCAGTACC AGAATTATTT
TGGTTTAGCT TGGTATTGGG TTTATTAGTG GTAGCAAGTA TCATTGGTTG GCAACAATTA
AGCCAAATCA AACCCATCAT TAAGCCAGTG CAATACATAA TGTTGCTAGC AATTGGCATA
TTTTTATTAA GCCCATTGAG TCGTTTTCCA CAGGAGCTAT TACCGCAAAA AACCAGTGAA
CATTATCGTG GAACCTATCG ATTTTTCTCG TATATTTGGC AGGATGAAGC ACTCAAAGCC
ACTAGTTGGG CCAACCAAGC ATTAGCGCCA AATACGACTA TTGGTTCGTG GAATGCAGGG
ATTGTTGGCT ATTTTTATGA AAATGGTTCG ACGATTAATT TAGATGGCTT AGCAAACAGC
CCAGCGTTTG TCGATGAGGT TTTACGCCAG AATATTTTAT TTACACGTGG CTTAGCAAAC
GAAAATGTGC TATGGAACTA TATTCAACAC CAGGATATTC GCTATATTAT TGATTCATGG
TATAGCGGGG AAATGGGCAA AAGCAGATTT ATTAATAGTA TTCCACCAGA GCATTATGAA
ATTATCTACG AGGGTGCGGT TACGTTTTCA GATGGGAATC GGCCTGATCG AAGAATGTAT
GTATTGAAAT TGAAGTATTA A
 
Protein sequence
MDFYAKACMG TFKLTSIKDR YIIVFGTFCI FLCYALFIFN SDVAYLAFTQ PQDDSFYYFL 
PAWNFKAYGF YTFDGLNETY GFQPLWMVLL TIMAFFISSK ILFFKLSLLV GCLFYLLTGL
IIYKISKLLI RQRLLTFIPW FVWVSNIYLL RIFASGKENS LSVFLYALIA LSLLNIHFKN
RQQSSLYGLI DASFMLVRIN NLMFVGPILL YRLYHNRQQK SQIGFYLATF SLVLSLWFGY
SYAAFGTLFP NSGSLKTVIS KPSIVYWLNQ QTGVELGGML SSQEQLLLQH PEFLDVPRAN
FFWQYLSQIV PQKVTDIYFD QKFSSITLVS TLNQVLLLSI GLGLIGFLGG LWQRSITINQ
QLIKLLSYCG IIALANSVVN WLFFQRYLSF TIWYPVPELF WFSLVLGLLV VASIIGWQQL
SQIKPIIKPV QYIMLLAIGI FLLSPLSRFP QELLPQKTSE HYRGTYRFFS YIWQDEALKA
TSWANQALAP NTTIGSWNAG IVGYFYENGS TINLDGLANS PAFVDEVLRQ NILFTRGLAN
ENVLWNYIQH QDIRYIIDSW YSGEMGKSRF INSIPPEHYE IIYEGAVTFS DGNRPDRRMY
VLKLKY