Gene Haur_2586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2586 
Symbol 
ID5734464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3319658 
End bp3320932 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID641279726 
Producthypothetical protein 
Protein accessionYP_001545352 
Protein GI159899105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000365583 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG ATGTTGTTCA GATACAGTAT GAGCAAACTG CCCAAGTTGC TCAGCATTTT 
GGCCGCCTCA AGCAAACGGT TGAACAATTG CAAACGACAA TTCGCAACGT CAGCCAATCA
CTGGTTGATG GCGATTGGCA GGGCGATGCC AGCGTTGCCT TTGCGAAAGA ACTCGATGGT
GAAATTTATC CAACCTTCAA TCGTTTGATC ACTGCATTCC AAACCGCCCA AGAAGTCACG
CTCGAGGTCC AAAAAATCTT TGGTCAAGCC GAAGAAGAGG CCGCTGCCTT ATTCAAAGGC
GAGTTATTCG GCAACGCTGA TGGCGGCAGC AAGGGTGGCG GAAGTTGGCT CGATAGCGTT
GGTGGCTTCT TCAGCAGTGT TGGCAATGGC ATCAAAGATT TCTTCGTTGG CGCAGGCAAA
GAACTCAAGG ATATGGTGGT TGGGGTTTGG AACATGGTCA CCAGCCCAAT CGAAACCGCC
AAAGGCATTT GGCATGCGGT AACCCACCCA GGCGAATTCT GGGAAGCCTT CAAAGCGCCG
TATGTTGAAG CCTGGGAAAA TGGCCGCCCA TGGGAAGCAA TTGGCCGTGG CACGATGTTT
ATTGGCTCGT TGCTAATTGG CACTAAAGGT GCTGATAAAG TTGGTAAAGC GGCTAAGATC
AGCAAAGCAG CAACCGTTGC TGATGCAGCA TCAGACATTG CCCGGGTCAG CAACCCACTC
CAAGCCGCCA GCGATATTGG CATGCTCGCC AAAGGTGGTG GCGCTGAAAC TGCCATGGCT
CGCTATATTG CCAATCAATC AAGCCATGTT GGTGCAGGCA CAGCCTTGAC CGATCGGGTG
GTTTTAGGCG CATTCAAGGC AGATCCGGCC TCGGGCTTCC TTGGCTATAT CGGTGAAGCC
AACGCTCATG GTGGCCGCTA TTTCAGCACC ACCAGCGATG TGTGGAATAA GCTCAAACCA
ATTGCCGAAG GTGGCAATAA TCGGATTTGG CCGGTCAATC GCGAATTCTT GCAAGCCCAG
CTTGAAAGTG GCATTAGCCG CATCGATATC AAAGGCAGCT CAATCGACGA TATTCTGACC
AAACGGCCTC AATCATATAG CGCCATGGAA GTGCGCTTTA TGCAAAGCCA AGCCTACCAA
TATGGCTATC GTCAAGTTGG CAATAGCTGG ATCAAGACTG GCGATTGGCG GGCCAGCACA
ACTGGTCGGG TCATTGGCGG CAGCATCGGA CCAGGTGGCG AGATTTTACA AACCAGCCAA
TCATTGAACG ATTAA
 
Protein sequence
MSKDVVQIQY EQTAQVAQHF GRLKQTVEQL QTTIRNVSQS LVDGDWQGDA SVAFAKELDG 
EIYPTFNRLI TAFQTAQEVT LEVQKIFGQA EEEAAALFKG ELFGNADGGS KGGGSWLDSV
GGFFSSVGNG IKDFFVGAGK ELKDMVVGVW NMVTSPIETA KGIWHAVTHP GEFWEAFKAP
YVEAWENGRP WEAIGRGTMF IGSLLIGTKG ADKVGKAAKI SKAATVADAA SDIARVSNPL
QAASDIGMLA KGGGAETAMA RYIANQSSHV GAGTALTDRV VLGAFKADPA SGFLGYIGEA
NAHGGRYFST TSDVWNKLKP IAEGGNNRIW PVNREFLQAQ LESGISRIDI KGSSIDDILT
KRPQSYSAME VRFMQSQAYQ YGYRQVGNSW IKTGDWRAST TGRVIGGSIG PGGEILQTSQ
SLND