Gene Haur_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4186 
Symbol 
ID5736048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5338761 
End bp5341247 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content54% 
IMG OID641281341 
Productvon Willebrand factor type A 
Protein accessionYP_001546946 
Protein GI159900699 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTTT TGGGATTATT CCTGGTTATA GGATTGAGTG GCTGGTGGAT TGCCCGCAAA 
ACAACCCTCG ATTGGCGGGT AGTTGGGCTG CGCTTGACGA GCTTAGCCTG TATTTTGCTG
GCACTGGCCT TACCGCGCAA CCAAGCTAAT CAACAAGCTA GCCCTTTGAT TTTGCTGGTT
GATCAATCGG CCAACTTGCC TAGCGAGTTG CGTGATGCCG CTTGGAATGA GGCTGTGCGC
TTCTATCAAC AACAAATCGA GCAACGTCCA GTGCGCTTAT TGGCCTTTGG GGCCGATGTG
CGGGTGAGCC AAACCGACCA ACGTCCAGCG ATTGACCCCA ATGGCAGCGA TTTGGCGGGA
GCATTGCAAT TTGCTAGTGG TTTGTTGCCG CAAGGTGGCG ATATTATTCT GCTCAGCGAT
GGTGCTAGCA CTACCACCAA TGGGCAAAAT CAGGTTAGTA CATTTGCGCA GCGCTCAATT
CGTTTGCATG GCGTGCCCAT CAGCTACCCC GAAACCGATA TTCGGGTGGA ATCGCTGCTT
GTGCCGCCAG CTTTGCGCGA AGGCGAGCGA TTTAGTGCCG ATGTGGTGCT CTATTCGAGT
GTTGATGGTC AAGTGCGCCT CGAATTGAGT AGCGACGGCG TGGGCTTGGC CGGTCAAACA
ATTAATGTTG AACAAGGCCG CAATCTGGTT TCATTTCAAT CGACCGCTGG TGCTCGCGGC
TTCCATCGTT TTCAGGCAAC ATTGCTAGCC ACCAACGATC AACAACCTGC CAACAATCAA
CTTGATGCCT GGACGGTGGT TGGGCCACCG CCGCGAGTAT TGATCATCGA ACGCTCACCA
GATAGCTCAG CCAACTTGCG CGATGCCTTA GAAGCTGCTA ATTTGGTGAC CGAAGCCTTA
CGCCCTGCCG CCTTGCCGAC CAGCCTCAGC CAACTCAGGG TCTACGATTC AATCGTGCTC
CAAGATATTT CTGCCAACGA TTTAAGCCTT GATCAGCAAT TGGCCTTGCG TGAATTTGTG
CGCAGCCTTG GCCATGGTGT GGTTGTATTA GGTGGAACCA ATAGCTATAA CTTGGGCAGT
TATGCTGGCA CGCCGCTCGA AGAATTGTTG CCAGTTTCAA TGGAGCCGCC GCCCCGCCGT
GAGCGCCCAA CCGTCACTCT GCTGCTGATT CTGGATCGCT CGGCAAGTAT GTTGGGCGAG
TCGGGCAAAG ATAAATTTAG CCTTGCCAAA GCTGCCGCGA TTGCCGCAAC CGATTCTTTG
GGAGCCGATG ATACGATTGG CGTGCTGGCA TTCGATGATA CCAACGATTG GACAGTGACC
TTTACCAAGG TTGGTCAAGG TGTGCAACTA AGCGAAATTC AAAATAATAT CGCTGGCTTG
AGTGCTGGCG GTGGAACTGA TATTTATGCC GCTTTGGAAG TTGGGATGGG CGGTCTGGCT
CAACAAACTG GCAAAGTGCG TCATGCCGTG CTGTTGACAG ATGGACGTTC TGGCGGCGAA
AGCTCCTATG AATCGCTGAT CGCTCCGTTA CGTGCCCAAG GCATTACGCT TTCGACAATT
GCGATCGGCG GCGATGCTGA TACCGTGCTG CTCGAATCGT TGGCCAAATT GGGTGCGGGA
CGCTATCATT TTGCCTCTAG ACCCGATGAT TTGCCGCGAT TGACCTTGCA AGAAGCCGAA
ATTGCCCGCG AAAATCCATT AACTGAGGGC CAATTTCAGG CTAATCTTGC TACGCCGCAC
CCCGCGATTC GTGGCCTGAA CCTCGGCGAA ATCCCGCCGT TTGGTGGTTA TGTCGCGGTT
ACGCCCAAAC CTGAAGCTGA GCAATTATTG ACCACTACCG AAGGCGATAT TTTGCTGGCA
ACTTGGCAAT ATGGGCTTGG TCGCGCCACT GCTTTTACCT CGGATAGCGG CGAACGTTGG
ACTGCCACAT GGCGACCTTG GCCAAATTGG GGCAATACCC TGGCGCAAAT TATCGCCGCA
ACTTACCCCA ACCCCGCCCG GGGCGACCTC CGAGTCAGCA GCGAATTGCA ACAGAATCAA
GCAATTATCA CTCTCGATGC GCAAGCTGAA ACGGGCGAAC TCTACGATTT GGCTGATGTA
GGCTTGCGGG TGCTGGCTCC CAATGGCAGC GAACAAATCT TGCGTGCACC CCAAATTGCG
CCAGGTCGTT ATCAAGCGCT GGCTGATGCC TCCCAAACTG GCGCGTACCA TATTTTGGCA
GCGTTGGAGC AAGGCCCAAA TCGGCTCGAA ACCCAAGCTG GCGTGATTCA TCCCTACAAT
CGTGAATGGG CGGTTTCGGC TAACCCCGCA CTGTTAGAGC AATTGGTCGG GCTTGGGCAA
GGCCAAATCG GCAGCTTGGA GCAAATTGCC CCCAGCCTGC AAGTTGCCAA CCAAACCAGC
AATACCCAAT GGTGGCCATG GCTGATTGCG CTTGCCTTAG GCTTATGGGT GGTTGAAATT
GCCATCCGCC GTGGAGTTAT TCGCTGA
 
Protein sequence
MIFLGLFLVI GLSGWWIARK TTLDWRVVGL RLTSLACILL ALALPRNQAN QQASPLILLV 
DQSANLPSEL RDAAWNEAVR FYQQQIEQRP VRLLAFGADV RVSQTDQRPA IDPNGSDLAG
ALQFASGLLP QGGDIILLSD GASTTTNGQN QVSTFAQRSI RLHGVPISYP ETDIRVESLL
VPPALREGER FSADVVLYSS VDGQVRLELS SDGVGLAGQT INVEQGRNLV SFQSTAGARG
FHRFQATLLA TNDQQPANNQ LDAWTVVGPP PRVLIIERSP DSSANLRDAL EAANLVTEAL
RPAALPTSLS QLRVYDSIVL QDISANDLSL DQQLALREFV RSLGHGVVVL GGTNSYNLGS
YAGTPLEELL PVSMEPPPRR ERPTVTLLLI LDRSASMLGE SGKDKFSLAK AAAIAATDSL
GADDTIGVLA FDDTNDWTVT FTKVGQGVQL SEIQNNIAGL SAGGGTDIYA ALEVGMGGLA
QQTGKVRHAV LLTDGRSGGE SSYESLIAPL RAQGITLSTI AIGGDADTVL LESLAKLGAG
RYHFASRPDD LPRLTLQEAE IARENPLTEG QFQANLATPH PAIRGLNLGE IPPFGGYVAV
TPKPEAEQLL TTTEGDILLA TWQYGLGRAT AFTSDSGERW TATWRPWPNW GNTLAQIIAA
TYPNPARGDL RVSSELQQNQ AIITLDAQAE TGELYDLADV GLRVLAPNGS EQILRAPQIA
PGRYQALADA SQTGAYHILA ALEQGPNRLE TQAGVIHPYN REWAVSANPA LLEQLVGLGQ
GQIGSLEQIA PSLQVANQTS NTQWWPWLIA LALGLWVVEI AIRRGVIR