Gene Haur_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2030 
Symbol 
ID5733919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2525082 
End bp2526734 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content50% 
IMG OID641279174 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544801 
Protein GI159898554 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTG AGTTTAACCA GATGCTTGAA GTTGTGCCTC CATGCCAAAG CGAGCCAAAC 
CAAGCACACG CAACCAGCCA ATTGCACAGC GGTTTTCTAC AGAGCGTCCA GCGCTATCCA
AATAACATCG CTTTAACGAT TGGTAATAAA CAGTATGATT ATGTCAAATT ATATACAGTT
GCCCAACGCT GGGCTTTCGC GCTTCGCCAA TCAACCAAAC CGCTGCATCG TGTTGGTATT
TTTGCCTATC GTAGTGAAGC AGCCTACATC GGGATTTTGG CAAGCTTATT GGCTGGCGCA
ACCTTTGTGC CGCTGAACTA CAACTTTCCA CTTCAACGGA CCCAAGCGAT GATCGAGCAA
GCAGAGCTTG ATGCGATTAT TGTCGATCAC CAATCGTATG ACCAATTTTT GCAATTGGCC
GATTCGCTGC CAGTACTACC GCCATGTGTC CTCTTGCCTG ATTGTTTGCG TGCGCCGCTG
CTTGATACAA TGATCTATAC TCAAGCCGAG CTTGCTGAGC TACCGACTGA TCATGAACCA
GTTACTGTGC CGCCTGAGGC AATTGCCTAT CTGTTATTCA CTTCGGGTAG CACCGGCAAC
CCCAAAGGCG TACCAATTAG TCATGCCAAT GTCGCACACT TTCTCAAGGT AAATCAAGCA
CGATATCAGA TTACGCCTGC TGATCGGCTG AGCCAGACCT TTGATCAAAC CTTTGATCTG
GCCATCTTTG ATCTTTTTAT GGCTTGGAAT CATGGTGCGG CGGTCTGTGT TATCCAACCG
ATCCAATTGC TCTCACCTTT TCGCTTAATT GAAGAGCAGG GAATTACGAT TTGGTTTTCG
GTACCATCAG TTGCCGCGTT ACTGCGCAAA CAAAAACTAC TCAAGCCCAA TAGCTTGCCC
AACTTACGCT TAAGCCTTTT TTGTGGCGAA GCGTTGCCCA AAGCTACCGC TGAGGCTTGG
CAACTTGCTG CGCCCAACTC AATAATCGAC AATCTCTATG GTCCAACCGA ATTAACAATC
GCCTGTGCAG TGTATCGCTG GAATTCCCTC ACCTCGCCTG CTGAATGTTT GAATGAAGTG
GTCCCAATTG GTAAACTCTA CCCAGGTTTA ACCGCGGTGG TGGTTGACGC AAACGATAAT
CCTGTACCAG CAGGTACAGA AGGCGAATTG TGTGTTGCTG GCCCACAAAC CTTCCAAGGC
TATTGGCACA ACCCAAGCCT CACGGAGCAA CGGTTTCTGC GCAGCAAACA GCTTAATGGC
GAGGAACTCG GCTACTACCG CACCGGTGAT CGGGTTGTAT GCCGCACCAA TGGCAGCATG
ATTTACCTTG GGCGCAGCGA TCAACAAATT AAAGTCCATG GCTACCGGGT GGAATTAAGC
GAGATTGAAG GGGCGTTATT ACTCCAACCA GGCGTAGTTG CTGCGGTTGC ACTGGGCTGG
CCGCTTGAAA ACGGTTCGGC GAGCGGAATT GTCGCGTTTG TAATTGCGCC AAGCATTGCA
GTCAGTGATC TGCAACAGGC GGTTCAGCCA TTGCTCCCAA GCTATATGCT GCCGCGCACC
ATCTATCAGC TTGAAACCAT GCCGCTGAAT GCCAATGGCA AAATTGATCG GTTGGCCTTG
GCTCGCCACT TAGCAGGTGA AGGAACGGCC TAA
 
Protein sequence
MMIEFNQMLE VVPPCQSEPN QAHATSQLHS GFLQSVQRYP NNIALTIGNK QYDYVKLYTV 
AQRWAFALRQ STKPLHRVGI FAYRSEAAYI GILASLLAGA TFVPLNYNFP LQRTQAMIEQ
AELDAIIVDH QSYDQFLQLA DSLPVLPPCV LLPDCLRAPL LDTMIYTQAE LAELPTDHEP
VTVPPEAIAY LLFTSGSTGN PKGVPISHAN VAHFLKVNQA RYQITPADRL SQTFDQTFDL
AIFDLFMAWN HGAAVCVIQP IQLLSPFRLI EEQGITIWFS VPSVAALLRK QKLLKPNSLP
NLRLSLFCGE ALPKATAEAW QLAAPNSIID NLYGPTELTI ACAVYRWNSL TSPAECLNEV
VPIGKLYPGL TAVVVDANDN PVPAGTEGEL CVAGPQTFQG YWHNPSLTEQ RFLRSKQLNG
EELGYYRTGD RVVCRTNGSM IYLGRSDQQI KVHGYRVELS EIEGALLLQP GVVAAVALGW
PLENGSASGI VAFVIAPSIA VSDLQQAVQP LLPSYMLPRT IYQLETMPLN ANGKIDRLAL
ARHLAGEGTA