Gene Haur_5099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5099 
Symbol 
ID5737057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp126357 
End bp128429 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content57% 
IMG OID641282264 
Producthypothetical protein 
Protein accessionYP_001547855 
Protein GI159901609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGCA TCCGTATCGT CGCGTTGGTG TTCATTGTCG TGCTCCTTGG ATGGTTCGCT 
CCACTTTATG CCCAACAATC AAATGATGTC ATCATTCAAC CGCATGCCTT CGTTGGTGAC
ACGTTTACCT ACCAAGGTCT CCTTATGCAA GGAACCACCT ATCCCAGCGG AACCTTTGAT
TTCCAATTTA GCCTCTATGA TGATCCGACC GCAGGTACGC TGCTTGGGCA GCTTCAACAG
GACGACGTTC CCGTTGAAGC TGGCCAATTT ACCGTCGCCT TGACCTTCCC CGAAGGCAGC
GTTACCGGTC ATCAACGCTG GTTGGCGATT GCCGTCAAAA CCCTGAATGG CAGCGCCTAT
GTTCCCTTGA ATCCACGCTC CGCCGTCAGC GCAGCGCCGA TCGCCCTCAG TCTGCCCGGC
CTCTGGACAC GGCAAAACGA TACCAGCCCC AACCTGATTG GGGGCTATAG CAGCAATACC
GTTCCAGCGA ATGGCGTTGG GATGACGATT GGTGGCGGGG GAGCCTTTGG GAATCTTCAA
CAAATCTATG ATCACTATGG CGTGATTGGC GGTGGGGCTA ATAATCGCGT GGGCAGCGAT
GATGGCACTG TCACCAACGA CGGCTATGCC ACGATTAGCG GTGGGTTTGG TAACACGACG
ACCCAAGAAT ATACCGTGAT TGGCGGTGGG CAGGCGAATA CCATCACGGG CGCATTCTCG
ACCATTAGCG GTGGGACGAC CAATACGATT GCGCATATCT ACGCGACCAT TGGCGGCGGG
ATGAACAATC GCGTTTCTGC CCAATTTGGC ACGATTGGTG GCGGTGGCAG TTCGGCCAGT
GCGACTGGCA ACCGCGTCTA TGATACCTAT AGTACGATCA GTGGCGGCTA TAACAATGTT
GCGGGAGTCG ATGACACGGG CAACCAACCA TTTGCGACGG TCGGCGGTGG GTCGAGTAAT
AACGCGAATG CGCTTGGGAG TATGGTTGGT GGCGGTCGTT CAAATAGCAT CAGTGCCATC
GCTGACTACA GTGTGATTAG TGGCGGCTAT AACAATGTTG CCACAGGATT GTATGCGACG
GTGAGTGGGG GAGGAAGTGC CTCAAGTGGC CAAGGAAATC GCGCCTACGA CAACTACAGT
ACCGTTGCTG GGGGCTATAA CAATGTTGCA GGGATCGATG ATTCAATCGG GCAACCTTTT
ACCACGGTTG CTGGTGGTGG CTCGAATACG GCCAGCGGTT ATGCGAGCGC AATTGGGGGT
GGGCGCTTGA ATCAGGCGAG TGGTCAGTAT GCCTTTATTG GGGGTGGTGA ATCGAATACC
GCTACGGGAG ATCACACGAC GATCGGCGCA GGCCGACAGA ACACGGCCAA CGGCAACTTT
TCGTCGATTC TCGGTGGGAG TGGCAATAGT ACGTCTGCTG ACTATAGTGT AGCCGCCGGG
GAAAATGCGG TTGCCGCCCA TCGTGGCAGT TTTGTCTGGG CCAGTACCCA AGCCGCGCCG
GATGCCACGA TCACCAGTAC CGCCCCGGGC CAATTTATTG TGCGTGCCCC CGGTGGGGCG
TGGTTTGGCA GCAGCACGCA GGTGGACATG CCGAATGGAG CCATCCTTGC GACGGATAGC
GGAGCCTTCC TCAGCAAGGG GGGAACGTGG TCAAATTCGT CGGACAAACA TCGCAAAACC
CAGTTTGCGG CGATTGATCC TCATGCCCTC CTTACCAAAC TGGCAGCGAT CCCGATGCAG
TCGTGGAGCT ACATCAATGA AGATCCGCAG ATTCGCCACC TTGGCCCGAC GGCGCAAGAT
TTTTATGCAG CCTTTGGTTT GGGCACGGAC GATCGGCATA TTGCGACCGT CGATGCGGAT
GGAGTTGCCT TGACCGCAAT CCAAGGACTG TATCAGCTGA ATCGGGAGCA AGCGGCAGTG
ATCACCGATC TCGAAACCCG CTTAGCCGCG CTGGAAACAG CCACCCCCTC GCCAGCGCGT
TCTGTATGGC TGCTCGCTGG TGGGTGGGGC AGTCTGCTGC TGATCGTTGG CTGGCTCGTT
GGCCGCCGGA TGCGACGTGG AGGGACGGTA TGA
 
Protein sequence
MGRIRIVALV FIVVLLGWFA PLYAQQSNDV IIQPHAFVGD TFTYQGLLMQ GTTYPSGTFD 
FQFSLYDDPT AGTLLGQLQQ DDVPVEAGQF TVALTFPEGS VTGHQRWLAI AVKTLNGSAY
VPLNPRSAVS AAPIALSLPG LWTRQNDTSP NLIGGYSSNT VPANGVGMTI GGGGAFGNLQ
QIYDHYGVIG GGANNRVGSD DGTVTNDGYA TISGGFGNTT TQEYTVIGGG QANTITGAFS
TISGGTTNTI AHIYATIGGG MNNRVSAQFG TIGGGGSSAS ATGNRVYDTY STISGGYNNV
AGVDDTGNQP FATVGGGSSN NANALGSMVG GGRSNSISAI ADYSVISGGY NNVATGLYAT
VSGGGSASSG QGNRAYDNYS TVAGGYNNVA GIDDSIGQPF TTVAGGGSNT ASGYASAIGG
GRLNQASGQY AFIGGGESNT ATGDHTTIGA GRQNTANGNF SSILGGSGNS TSADYSVAAG
ENAVAAHRGS FVWASTQAAP DATITSTAPG QFIVRAPGGA WFGSSTQVDM PNGAILATDS
GAFLSKGGTW SNSSDKHRKT QFAAIDPHAL LTKLAAIPMQ SWSYINEDPQ IRHLGPTAQD
FYAAFGLGTD DRHIATVDAD GVALTAIQGL YQLNREQAAV ITDLETRLAA LETATPSPAR
SVWLLAGGWG SLLLIVGWLV GRRMRRGGTV