Gene Haur_5250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5250 
Symbol 
ID5737208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp20302 
End bp21900 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content63% 
IMG OID641282414 
Producthypothetical protein 
Protein accessionYP_001548005 
Protein GI159901760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.879154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCC GTTGGCTTGG GGTTGGTATG CTCGTGTTCT TGGCAGGCTG TAGCCGCCTT 
GCGCCAAGCC AGCTTGCCGG AACCGCAGCG GTCTTGGGCT GTTGGCCCTA TGGGTTTGAG
CCGCCGCCAC CGACGGCGAC CAATCCGGTG ATGCTGTCAC CACCGCCGAG CGGCACGGGA
ACGCCATTGC CCACGTTGAC CGCTAGCCCG ACCGCCGCGC CAACCATGCC TGTCTGTACC
CCCGCCCCGA ACACACCAAC CCTGACCCCC AGTCCTACGC CCTCGCCAAC CCCATGGACA
CGACCCACCG CCGCCCCACC TGGGGGCAAA GGCATGCCGC CGCTGAATCT CTCGAACATG
CCAGGCTACG ACCAAGACCC CGCGATTGCC GTCCACCCCA CGCAGGGCTG GGCCGCTGTG
GTCTGGTCAA ATTGGCTGAA TGAGTTTCCG CAGGAGGCGA CGGTGCTGGT CAAGGTGCAA
GACCCACAGA CCAAAACATG GCGGCAGGGG ATTGGGGTGA ACACGGCAAC CGTCACCAAA
GGCGCAGGCG CACCCGCAAT TGCCATCGAT GCCCAAGGTC GCATCCATGT GGTCTTTGCG
CAAAATGGGC AGGTGGTGAT CACGAGCAGC AGTGACGCGG GCCGAACGTG GACACCGCCC
GAACCCATCC CACTGCCCAG TGGCAGTCAG GGCGGGCGGA TGTTTCAGGT GGCAGTTGAT
GCAGTGGGCC AGCTGCATGT GTTTTTTATC AGTGCGGATG CCTGTTTTGA TTGCTTTCAC
GCGGTGCATG CCCAACGCGC CAGCGATGGC AGCGGGCCAT GGGTATGGCA GGATTGGCTG
ATCAATGACT CCAAACAACT TTATGGTGAC ATCGCCACCG TGCCGTTGGC CAATGGCACG
ATCCGCACGG TCGTGGCGAT TGGGGTGGGC GATGGGGTGC GCATCGTGAC GCAGGACGGA
CGCAATGGCC CATGGGTGGC GCGACCGTTG TCATTTGGCG GGTTGCCAAT CCAGCCGCAG
GTCGTGGCAT GGATTGACCT GGTGGCCTTT ACCGATCAGG CGGGTCAGGC CCAGGTCTGT
GTCAGTTGGG GCCAATATAG CAAAAGCGGG GTGTTTGTCG CCTGCTCGCG CGACGGTGGC
CAAACGTGGG ATGTGCCGGA GATTCTGGCC ACCCACGCGG CACCAGGAGC CGCGCCAACG
CCCGATCCCG CCGCGCCAAC GCCCGCCTTG GAAGACAATC CCAGTCCCAG TGAGGGCAGC
GGCCAGCGCG GCTTTCACCC CGAACTCTTG TATGAACCCG CGACGGATAG CCTGATGGCC
GTGTGGAGTC TGCTCGATGG CAGCGCCTCA ACCATTGTCT ACAGCTATCG ACCTGCCCAG
GGCGGGGCAT GGTTGCCCGT GATGAATACC ATCACCACGG AACCCGCATT GGGCGTGTTT
GGCGCAACCC GCCGCAGTGC CGCCCGCAAT CCGCGCTTAG CCTTTGCTGG ACAGGGCGTG
GCGATGGTCG CGTGGATGGA AGTGGAGCGC GATGAAAACC TTGAGGTCTA TGTCGGCGGC
TTTCTGCCCG CCACCCTCTT AACCCGCGCC GAGAACTAA
 
Protein sequence
MSRRWLGVGM LVFLAGCSRL APSQLAGTAA VLGCWPYGFE PPPPTATNPV MLSPPPSGTG 
TPLPTLTASP TAAPTMPVCT PAPNTPTLTP SPTPSPTPWT RPTAAPPGGK GMPPLNLSNM
PGYDQDPAIA VHPTQGWAAV VWSNWLNEFP QEATVLVKVQ DPQTKTWRQG IGVNTATVTK
GAGAPAIAID AQGRIHVVFA QNGQVVITSS SDAGRTWTPP EPIPLPSGSQ GGRMFQVAVD
AVGQLHVFFI SADACFDCFH AVHAQRASDG SGPWVWQDWL INDSKQLYGD IATVPLANGT
IRTVVAIGVG DGVRIVTQDG RNGPWVARPL SFGGLPIQPQ VVAWIDLVAF TDQAGQAQVC
VSWGQYSKSG VFVACSRDGG QTWDVPEILA THAAPGAAPT PDPAAPTPAL EDNPSPSEGS
GQRGFHPELL YEPATDSLMA VWSLLDGSAS TIVYSYRPAQ GGAWLPVMNT ITTEPALGVF
GATRRSAARN PRLAFAGQGV AMVAWMEVER DENLEVYVGG FLPATLLTRA EN