Gene Haur_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1000 
Symbol 
ID5732903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1143673 
End bp1145634 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content51% 
IMG OID641278134 
Productacetate--CoA ligase 
Protein accessionYP_001543776 
Protein GI159897529 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0778524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGC ATGCTGCACC AACCACTGTT GGTCAATTTT CGGCAAGCAA CGACGCTGAA 
TCAGCATTCT ACTACCCACC TGCTGCCTTG GTTGAAAACT CGAACGTGCT GGCCTACGCT
CGTGAAAAAG GTTTCGCCGA TGTCGAAGCC ATGCGTCAAT GGTCGATTGA GCATTATCAG
GATTTCTGGG CCGACATGGC AACTCGGATG GTCGATTGGT ATATGCCATT TAGCAAGGTG
CTCGACGATA GCAAAGCACC GTTCTACCAA TGGTTTAACG ATGGCAAGAT CAACATTGTG
CACAATGCGC TTGATCGCCA TGTCAAAACT TGGCGCAAGA ATAAGCTGGC CTTGATTTGG
GAAAGTGAAA AGGGCGATAA CAAGACCTAT AGCTATTGGC AATTGTTCAA ACGAGTCAAT
AAATTTGCCA ATGTGCTCAA ATCGATGGGG GTCAAAAAGG GCGATACCGT GACGATCTAT
TTGCCACGGG TTCCCGAAAT TGTGATTGCG ATGTTGGCTT GTGCCAAAAT TGGCGCGATG
CACAGCGTGG TCTATGGTGG TTTTAGTGTC GAAGCGCTGC AAACTCGCAT CCAAGATGCC
CAATCGCGGG TAGTGATTAC GGCTGACGGC GGTTATATGA ATGGCAAAGT CGTTGAGCTG
AAAAAGATCA CCGACGATGC GATTAAGCAT TCACCAGTGG TCGAAATTGT GATTGTAGTC
AAGCGCACAG GCCACGAAGT CGAAATGCAA CAAGGCCGCG ACTTGTGGTA TGAAGAATTG
ATGGCCTTGC CAATCGCTTC GACCAAGTGC GAAACCGAGC AACTTGATGC TGAACATCCT
TTATATATGC TCTACACTTC GGGCACGACT GGCGCTCCTA AAGGTTTAGT GCATACCCAT
GGTGGCTATC AAGTTGGGGT TGCGACGACC TTGCACTTTA ATCTTGATAT CAAAGAAGAT
GATGTGTATT GGTGTGCCGC CGATCCAGGC TGGGTCACGG GCCACTCCTA CATTGTCTAT
GGTCCCTTGA TGCTCGGCGC AACCCAAGTG ATGTATGAAG GCGCACCAAC CTTCCCCTTC
CCCAACCGCT GGTGGAATAT TGTCGAGCGC TATGGCGTAA CCGTGCTCTA CACTGCTCCC
ACCGCAATTC GCGGCTTGAT GCGCTTTGGC GAAGCCTGGC CTAATCGCCA TGATCTTGGC
TCGTTGCGCT TGCTTGGCTC AGTCGGCGAA CCAATTAACC CCGAAGCATG GCGCTGGTAT
CATCGGGTGA TTGGCCGCAA CAATTGCCCA ATTATGGATA CTTGGTGGCA AACTGAAACC
GGCAGCATGA TGATCACGCC TAATCCAACC ACGCCACTCA AGCCAGGCTC AGGCACTCGT
GCCTCGTTTG GCATCGATGC CGATGTGGTC AACGATCAGG GTGAGCATGC CAGCGATGAT
GAAGATGGTT TGTTGATTAT CCGCAATCCA TGGCCTTCGA TGTTGCGCAC GATTTATAAC
AACCCTGAGC GCTATATCGA ACAATATTGG AGCCGAATTC CAGGTGTGTA CACCGCTGGC
GATTCAGCCC GCAAAGACGA AGACGGCTAT TTCTGGGTAA TTGGTCGGAT CGACGATGTA
ATCAAGGTTT CAGGCTATCG GTTGGGCACG GCTGAAGTTG AGTCGGCTTT GGTTTCGCAT
CCATCAGTTG CCGAAGCTGC CGCAATCGGC TTGCCTCACG AAGTCAAAGG CAATGCGATT
CATACCTTTG TGATTTTGAA GAATGGCTAC GAAGCCAACC AAGATCTTGA GGATGCGCTG
ATTGCCCACG TTGGCAAAGT GATGGGGCCA ATTGCCCGCC CTGAGGCAGT ACAATTTGTG
CCAAGTTTGC CTAAAACCCG CTCAGGCAAA ATTATGCGCC GCGTCCTCAA AGCCCGTGCC
CTAGGCTTGC CCGAAGGCGA TTTGAGCACT TTGGAACAAT AA
 
Protein sequence
MTEHAAPTTV GQFSASNDAE SAFYYPPAAL VENSNVLAYA REKGFADVEA MRQWSIEHYQ 
DFWADMATRM VDWYMPFSKV LDDSKAPFYQ WFNDGKINIV HNALDRHVKT WRKNKLALIW
ESEKGDNKTY SYWQLFKRVN KFANVLKSMG VKKGDTVTIY LPRVPEIVIA MLACAKIGAM
HSVVYGGFSV EALQTRIQDA QSRVVITADG GYMNGKVVEL KKITDDAIKH SPVVEIVIVV
KRTGHEVEMQ QGRDLWYEEL MALPIASTKC ETEQLDAEHP LYMLYTSGTT GAPKGLVHTH
GGYQVGVATT LHFNLDIKED DVYWCAADPG WVTGHSYIVY GPLMLGATQV MYEGAPTFPF
PNRWWNIVER YGVTVLYTAP TAIRGLMRFG EAWPNRHDLG SLRLLGSVGE PINPEAWRWY
HRVIGRNNCP IMDTWWQTET GSMMITPNPT TPLKPGSGTR ASFGIDADVV NDQGEHASDD
EDGLLIIRNP WPSMLRTIYN NPERYIEQYW SRIPGVYTAG DSARKDEDGY FWVIGRIDDV
IKVSGYRLGT AEVESALVSH PSVAEAAAIG LPHEVKGNAI HTFVILKNGY EANQDLEDAL
IAHVGKVMGP IARPEAVQFV PSLPKTRSGK IMRRVLKARA LGLPEGDLST LEQ