Gene Haur_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3798 
Symbol 
ID5735662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4768124 
End bp4769623 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content51% 
IMG OID641280950 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001546562 
Protein GI159900315 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[I] Lipid transport and metabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATTG GTGACTGGCT TGGCAAGCGC GAGTTGCTTA CGCCAGAGCG ACTAGCGCTG 
GTTGACGACC GTGATGGCGA GCGCTATAGC TACCGCCAAT TGAATAGCCG TGCCAATCGT
TTGGCTGCGA GTTTGCGCCA ACGTTTTGGT GTAGGCAAAG GCGATCGAGT AGCAATTTTG
GCCAAAAATC AAATTGGCTA CCTCGATGCC TTATTTGCCA CTGGCAAGCT TGGGGCGATT
TTAGTGCCAC TCAATTGGCG CTTAACCGAG CATGAATTAA TTTATATGCT CAAAGATAGT
GCATCGAGCA TATTGCTCTA CGATAGCCAA TTTGCGCCGC TGCTGCCAAC CTTGCGCAGT
CAAACCCCAA TCAAGCAGTG TGTCCAGTTT GGGCCTGAAT ACGATCAACT GCTGACCCAA
GCTAGCGATT TGCCGATCAG CGAATCAGTT GATCTTGATG ATCCGCACTT GATTTTGTAT
ACCTCAGGCA CAACTGGCGC ACCCAAAGGC GCAGTACTTT CGCATCGGGT GCTGGTGTGG
AATTCGCTCA ATACCAATGT TGGCTGGGAT TTACACGCCG ATGACGTGAG CATTATTCAT
ACGCCACTAT TTCATACTGG CGGCCTGAAT GTGCTAACCC TGCCGATTTT GCATGCTGGT
GGCACAATGG TTTTGATGCA AGAATGGAAT CCCGAGCGCT GTTTGCAATT GATTGAGCAA
GAGCATGTGA CGATCTTTTT TGCCGTGCCA ACCATGTTTG AGATGCTGCT GCAAGCGCCC
AATTTTGTCC AAACCAACCT CAGCAGTCTG CGTTTTTGCA TCGCTGGCGG CTCGCCCTGC
CCAATTCCCT TGATCGAAGC CTATCAGCAG CGCAATATTC CGTTTCGCCA AGGCTATGGC
CTGACCGAAG TTTCGGTTAA TTGCTTCACG CTCAACCCAG AGGATGCAAT TCGTAAGGCT
GGATCGGTGG GCAAGCCGAT TTTTCACCTT GATGCCCGCA TTGTCGATGA GGCGGGCCGC
GATGTGCCGA CCAACAGCAT TGGCGAATTG ATTTTATATG GGCCGACGGT GTGCAATGGC
TACTGGCGCA ATCCGGTCGC AACCGCCCAA GCCCTGCAAA AAGGTTGGTT CTACACTGGC
GATCTAGCAC GGGTCGATGC TGAAGGTTAT TTCTACATCG TTGATCGCAA GAAGGATATG
TATATTTCTG GCGGCGAGAA TGTTTACCCT GCTGAGGTTG AAAACGTGCT CTATCAGCAC
CCTGCGGTAC AAGAATGCGC CGTGATTGGC ATACCCGATA GTCGCTGGGG CGAGGTTGGG
CGGGCTTTAG TGGTGTTGCG GCCAAGCACG CAGCTTGATG AGCCAACCCT GATCGCTTTT
TGTCGCGAAC GCCTGGCTAG CTACAAAACC CCAAAATCGA TTTATTTCTT GCCTGAGTTG
CCGCATAACG CCAGTGGCAA GGTCGTCAAG CCTGAGCTAC GCAAATTGTT TGGCTATTAG
 
Protein sequence
MYIGDWLGKR ELLTPERLAL VDDRDGERYS YRQLNSRANR LAASLRQRFG VGKGDRVAIL 
AKNQIGYLDA LFATGKLGAI LVPLNWRLTE HELIYMLKDS ASSILLYDSQ FAPLLPTLRS
QTPIKQCVQF GPEYDQLLTQ ASDLPISESV DLDDPHLILY TSGTTGAPKG AVLSHRVLVW
NSLNTNVGWD LHADDVSIIH TPLFHTGGLN VLTLPILHAG GTMVLMQEWN PERCLQLIEQ
EHVTIFFAVP TMFEMLLQAP NFVQTNLSSL RFCIAGGSPC PIPLIEAYQQ RNIPFRQGYG
LTEVSVNCFT LNPEDAIRKA GSVGKPIFHL DARIVDEAGR DVPTNSIGEL ILYGPTVCNG
YWRNPVATAQ ALQKGWFYTG DLARVDAEGY FYIVDRKKDM YISGGENVYP AEVENVLYQH
PAVQECAVIG IPDSRWGEVG RALVVLRPST QLDEPTLIAF CRERLASYKT PKSIYFLPEL
PHNASGKVVK PELRKLFGY