Gene Hoch_4971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4971 
Symbol 
ID8547379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6851732 
End bp6853402 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID646389645 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003269353 
Protein GI262198144 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.212959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCT CGGGCAATAA TCTCGCCGGC CACCTGCTCA CGGCTGCACA AGAGGCTGGT 
CACGGGCCTC GTGTCGCCCT GCGACAGGGG GACGAGACCT GGACCTACGA CGCCCTGCGC
GAGCAGGTCA CGCGCGCGGC CGGTGCGCTC ACCGCGCTCG GTATCGGCCG CGGCGAGCGC
GTGGCCATCC TCATGCCCGA CTCGCTCGAC GCCGCCGCCG CGCTGCTCGG CATCATCTAC
GCGGGCGCGG TCGCGGTGCC GCTCGGGGAA CTCACCCGGG CGACCGAAAT CCGCGCGTAT
CTCGACCACT GCGGCGCCAA AGCCGCGATC GTCCACGCCG CGCAAGTCGC CGCGCTCGAC
GCGGTCCGCG GCGAGCTCGC GCAGCTCGAC GCCATCCTGT GCGTCGGCGG CGAGGCCGCC
CCGGGCATGC ACGACTACCG CGAACAGCTC GCCGCCACCC AGCCGCGCGA GCAGGCCGAG
GCCGTGGAGT CGAGCGACGC CGCCATGCTC GTGTATTCGA TCGCCGACGT CGAGAGCGAT
CTCCGCGGCG TCCCCCACAC CCACGGCACA CCGCGCTCGG CCTTTGCCTC CTTCGCCCAG
GGCGTCCTCG GCATCGGCCC CGACGACCGC GTGCTGTGCA TGGCGCAGCT ATCGACCGTG
TACGGGCTCG GCCTCGGCCT GTTCTTTCCC CTGGCCGCCG GCGCGCAGAA CCTGTTCGTG
GCCGAGCAGC CGAGCACCGA GGACATCGTC GCCGCGGTGC GCGACTTCGA GCCGACCGTG
GTCATGGCCG CGCCCTCCCT CTACCGCCAG CTCGTGCGCG ACGCCGAGAG CGACAGCGAC
GGCGAGCCCG CGCCGCTCCT GAGCGCCTGT CGCGCCTGCA TCGCGGGTAG CGAAGGCATG
CCGCCGCGGC TCATGGAGCG CGTGCGCGAG GTGCTGGGCG CGCCCATCAT GGTCGGCTTC
GGCCTCAGCG AGGCGTTTCA GTTCGTGCTC ATGGGCACCC CCGAAGACGC CCTGCAAGGC
GCCTGCGGAC GCCCGGTCGC CGGGTTCGAA GCCCGCCTGG TCGACGAAAA CGGCGAGCCG
GTCGGCCCCA GCGTCATCGG CACCCTGCAG ATCCGCGGGC CCACGCTGCT GTCCTCGTAC
TGGACCCCGG AGCACGAGGA AGAGGCCATG GGCCGACGCG CGCCCACCGA CCGCGATCGC
GCCGAGCCCG ACTGGGTCTG GCACCGGCGC ACCTGGCCCG AGGCCGCGTG GCCGGCCCAG
CACTGGGGCG ATGGCTGGTT CACCACCCGC GACCGCTTCC TGCGCGACGA GGGCGGCAAC
TTCTACCACT TCGGACGCAT CGACGATCTC TTCAAGGTCG GCGGCAAATG GATCTCGCCC
GCGGAGATGG AGCGCGCGCT CGGCGCCCAC GAAGCCGTGT GGGAGTGCGC CGTGGTCGGC
ACCGAAGACG AAGACGGGCT GACCAAGCCG ATGGCCTTCG TGGTGCCCAA CGTGGGCCAC
ACCGCCGGCC CCGAGCTGGC GCGCACCCTG CGCGACTACC TCAAGTCCGA GCTGGCGTCC
TACAAATACC CGCGCTGGCT CGAGTTCGTC GAACACCTGC CGCGCGGCCC GCAGGGCAAG
ATCCTGCGCT TCAAGCTGCT CGCGCGCAGC AAGATGCCGC GCAAGCAGTA G
 
Protein sequence
MPTSGNNLAG HLLTAAQEAG HGPRVALRQG DETWTYDALR EQVTRAAGAL TALGIGRGER 
VAILMPDSLD AAAALLGIIY AGAVAVPLGE LTRATEIRAY LDHCGAKAAI VHAAQVAALD
AVRGELAQLD AILCVGGEAA PGMHDYREQL AATQPREQAE AVESSDAAML VYSIADVESD
LRGVPHTHGT PRSAFASFAQ GVLGIGPDDR VLCMAQLSTV YGLGLGLFFP LAAGAQNLFV
AEQPSTEDIV AAVRDFEPTV VMAAPSLYRQ LVRDAESDSD GEPAPLLSAC RACIAGSEGM
PPRLMERVRE VLGAPIMVGF GLSEAFQFVL MGTPEDALQG ACGRPVAGFE ARLVDENGEP
VGPSVIGTLQ IRGPTLLSSY WTPEHEEEAM GRRAPTDRDR AEPDWVWHRR TWPEAAWPAQ
HWGDGWFTTR DRFLRDEGGN FYHFGRIDDL FKVGGKWISP AEMERALGAH EAVWECAVVG
TEDEDGLTKP MAFVVPNVGH TAGPELARTL RDYLKSELAS YKYPRWLEFV EHLPRGPQGK
ILRFKLLARS KMPRKQ