Gene Haur_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0801 
Symbol 
ID5732701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp905439 
End bp906626 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content51% 
IMG OID641277932 
Producthypothetical protein 
Protein accessionYP_001543577 
Protein GI159897330 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGGC GACGAGCAGG CTTATTTGTT TTGGTCTTGC TCTTGATCAT TGGCTTGCTA 
AGCGATGTAA CGTTGTTGCC GTTGGTCGCG ATGCTGGGGA TTATGGCCTT GCTCGCGGTC
GAATTATGGG AATGGCGCAT GTTTAAGCAT GTTGATTATC AACGTGAACT AGGCCATACC
CATCTGTTCC CCGACAACCG CACAACCCTT AGCATTACCC TGCGCAATCG CAAATTTTTG
CCCTTGCCAT TTGTTAATTT GCACGATTTG GTTCCAGTAG GCATCACGCT TGAGCAGATT
GAAACCCAAC CTGCTGCTAG CCCCAATTAT CGGGTGCTAG CGCGGGCATT TGGCATCAGC
AGTTATCAGC AAGTAACCCG CCAATATACA ATTTTGTGCC CACAGCGCGG TTTGCACCGT
TTTGGCCCAG CCAATTTGAG TGCTAGCGAT CCGCTTGGGC TGAGTATTAG TCGCGCAACA
ATTAATGAGA TTGATCGCTT GATCGTCTAC CCTCGCTTGT TAACTGAGCC AGAATTAGGC
TTGCCGTTAC GCGAGTTATT GGGCACGATT CGCGCCTCGC AGCGCTTATT GACCGACCCT
GTTGTGCCGA TTGGTATCCG CGATTATACC CAAAGCGACC CGCTCAAAAG CATTCACTGG
ACGGCGACAG CGCGGCGCGG CCAATTACAA ACTCGCATTT ATGAGCCAGT TACGGCGCTG
ACCGTGATGT GTATTCTTGA TATCGAAACG ATTGTGCCAT CCTATCTTGG GGTGAATAAA
TTTCAGGGCG AACGTTTAAT TAGTATGGCG GCAACGGTTT GTAGCGCTTT ACACAAAGCT
GGCCATGCGA TTGGTTTATG GTCGAATGCC GCGCTGGTTG AGGGCAACAC GGCCATTCAG
CTGCCGCCCA ATCGTAGCCC CAAACAGGCC AGCGCCATCT TGGAAGTATT GGCCCAAATG
TCGCTCTACT CGCGGCTAGA AATTGCCAAA TTTATTGGGC GTGAACAATC ACGCTTACCG
CTAGGCGCGA CGGTTTTGCT GATTAGTGCG GTGGATACGC CGGCTCATCG CAGCGCCTTG
GCCCGTTTAC GCGAATATGG CTATGCCCCC GTTTGGCTCT ATTTAGGCCA GCATGCACCA
AAAGTTGCTG GAGTTAAGTT GATTCATAGT CACCAGAGGG AGCCATGA
 
Protein sequence
MKGRRAGLFV LVLLLIIGLL SDVTLLPLVA MLGIMALLAV ELWEWRMFKH VDYQRELGHT 
HLFPDNRTTL SITLRNRKFL PLPFVNLHDL VPVGITLEQI ETQPAASPNY RVLARAFGIS
SYQQVTRQYT ILCPQRGLHR FGPANLSASD PLGLSISRAT INEIDRLIVY PRLLTEPELG
LPLRELLGTI RASQRLLTDP VVPIGIRDYT QSDPLKSIHW TATARRGQLQ TRIYEPVTAL
TVMCILDIET IVPSYLGVNK FQGERLISMA ATVCSALHKA GHAIGLWSNA ALVEGNTAIQ
LPPNRSPKQA SAILEVLAQM SLYSRLEIAK FIGREQSRLP LGATVLLISA VDTPAHRSAL
ARLREYGYAP VWLYLGQHAP KVAGVKLIHS HQREP