Gene Haur_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3163 
Symbol 
ID5735035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3993404 
End bp3995056 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content49% 
IMG OID641280306 
Productvon Willebrand factor type A 
Protein accessionYP_001545928 
Protein GI159899681 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGCT TCTTCCGTAG TTTTCTAATC GTTATTATGC TGGTGCTTGC AGCTTGTGGC 
GAGGCAGCCC AACCTACCAC CACCACTAAC CAAACCAGCG GGCCAGTTGT AGGCGCAAAT
GATCTACTGA TTAGCATCAC CTACAGCCCA GAGAAAACCA AATGGCTTGA AGAACGCATC
ACCACTTTCA ACAACCAAAA TGTGCAATCC AACGGCAAAC GGGTCATCGT TGAGGGCAAA
GAGCTTTCCT CAGGCACAGC CCGCACTCAA ATTCGCGCTG GCCAACTTCA GACAACAATC
TGGACACCTT CGGCCTCTAC CTGGCTGGAA GTGTTGAAGA AAGAAAGTAA TAATCCCACG
ATTGCCGAAG CCGATCCTCA ATCGTTAGTG CTCACACCAG TGGTTATTTC GATGTGGAAA
CCAATGGCCG AAGCCATGGG CTACCCGGGC AAAGAGGTAG GCTGGTCGGA TATGTTGGCG
CTGATTCAGG ATACCGAGGG CTGGGGCAAA TTCGGTCAAC AAGATTGGGG CCGCTTCTCG
TGGGGCCACA CCGACCCCGA CATTAGCACC ACCGCGCTTT CAACCGTGCT GGCTGAGTTG
TATGCAGCCA ACGGCAAAAC CAGCGATTTG ACGGTCGAAG ACATCAATCA AGAAAAAAGC
CAACAATTTT TGCGTGATTT AGCCCAAGGC ATCAAGCACT ATGGCTCAAA TACCTTGGTG
TTCAGCCAAA ACATGCAAAA ATATGGCATG GCCTATATTT CAGCCTTTCC GATGGAAGAA
ATTACCCTGA TTGATTTCAA CAAACAGGCT CCCAATGTCC CGTTAGTGGC AATCTATCCC
AAAGAAGGCA CGTTTATTCA CGATAATCCC TTTATTGTGA TGAGCGATGC AACTGCCGAC
CAAAAAGCTG CTGCCAGCGT TTTCTATGAT TTCTTGCTTA CGCCTGAAAG TCAAAATTTG
GCCATGCAGC AAGGTTTTCG GCCAGCGAAC GTTGATGTAG CGTTGGCATC ACCATTAACT
GCCCAATTCG GCGTAGATCC CAATCAACCA CGGAATTCGT TGGCAACCCC ACCAGCCGAT
GTGATTGTGG CTGCCAAAAA TGCTTGGGCT AATAATCGCA AGCCCGCCAA TATTATGTTG
GTGGTCGATA GCTCTGGCTC GATGCGCGAC GACGACAAGA TGGATCAAGC CAAACTTGGG
GTTGAGGTGT TTCTCAATCG CTTGCCAAGC AAAGATAACG TCGGCATGAT CGGCTTCTCA
TCAAGCCCAG CCGTGTTGGT GCCATTGGCA ACTCGTAGCG AAAACATGGC TAATTTGCAA
ATGCAAACCC AAGGACTCGT GCCCGATGGC AACACCTCGC TCTACGATGC GATCGATTTG
GCTCGTCAGG AATTGGAAAA CCTCAAACAA CCTGATCGGA TTAACGCGAT TGTGGTGCTA
AGCGATGGTG CTGATACGGC CAGCCAGCTT TCAATCGATC AAATGCTCGG TAATTTTGGC
GAATCGAGCA TTCAAATCTT CCCGATTGCC TATGGTGCTG ATGCTGAAAC TTCAATTTTG
CAACAAATTG CCGATTTCTC CCGCACCGAG TTGGTTCAAG GTAGCACTGG CGATATTGAT
AAAATCTTCG AAAATTTGAG CCGCTACTTC TAA
 
Protein sequence
MQRFFRSFLI VIMLVLAACG EAAQPTTTTN QTSGPVVGAN DLLISITYSP EKTKWLEERI 
TTFNNQNVQS NGKRVIVEGK ELSSGTARTQ IRAGQLQTTI WTPSASTWLE VLKKESNNPT
IAEADPQSLV LTPVVISMWK PMAEAMGYPG KEVGWSDMLA LIQDTEGWGK FGQQDWGRFS
WGHTDPDIST TALSTVLAEL YAANGKTSDL TVEDINQEKS QQFLRDLAQG IKHYGSNTLV
FSQNMQKYGM AYISAFPMEE ITLIDFNKQA PNVPLVAIYP KEGTFIHDNP FIVMSDATAD
QKAAASVFYD FLLTPESQNL AMQQGFRPAN VDVALASPLT AQFGVDPNQP RNSLATPPAD
VIVAAKNAWA NNRKPANIML VVDSSGSMRD DDKMDQAKLG VEVFLNRLPS KDNVGMIGFS
SSPAVLVPLA TRSENMANLQ MQTQGLVPDG NTSLYDAIDL ARQELENLKQ PDRINAIVVL
SDGADTASQL SIDQMLGNFG ESSIQIFPIA YGADAETSIL QQIADFSRTE LVQGSTGDID
KIFENLSRYF