Gene Hoch_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3584 
Symbol 
ID8545974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4937191 
End bp4938468 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content63% 
IMG OID646388253 
Productcitrate synthase I 
Protein accessionYP_003267979 
Protein GI262196770 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00887444 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAATA AAGCGAAACT CATCATTGAA GGAAACGAGC ACGAACTGGA CATCATCGAG 
GGATCGGAAG GCGAGAAGGC CCTCGACATC CGCAAACTCC GGGCCGACAC GGGTTACATC
ACCATGGACT CGGGCTACGC CAACACCGGC GCGGCCGAGA GCTCGGTGAC CTACCTCGAC
GGTGAGCAGG GCGTGCTGCG CTACCGCGGC TATCCCATCG AGCAGCTGGC CGAGAACTCC
TCGTTCACCG AGGTCGCCTA CCTGGTCATC TACGGCAAGC TGCCGAACAA GACCGAGCTG
TCCGAATTCC GCGAGCTGCT CACCTACCAC AGCATGATCC ACGAGGGGAT GCGCCACTTC
TTCGAGGGCT TCCCGCCGTC GGGTCATCCG ATGTCCATCC TCTCGTCGAT GGTGTGCTCG
CTCTCGGCCT ACTACCCCGA CTGCCTCGAG ATCGACGCCG ATGACAACAT GCACGTGGCT
CGCGTGCTAT CGAAAGTACG CACCATCGCG GCCTTTGCGT ACAAGCACTT CATCGGCCAG
CCGATCATGT ACCCGCGCAA CGACCTCAAC TACTGCGCCA ACTTCCTGTA CATGATGTTC
GCCGTGCCGG CCGAGCCCTA CGAGCCCAGC CCCGAGGCGG TCAAGGCGCT CAACATGCTG
CTCATCCTGC ACGCCGACCA CGAGCAGAAC TGCTCGACCT CGACCGTGCG CCTCGTCGGC
TCGTCCAACG CCAACCTCTA CGCCTCGATC TCGGCCGGTA TACTGGCTCT CTGGGGCCCG
CTGCATGGCG GCGCCAACCA GGCCGTCATC GAGATGCTCG AGCAGATCCG CGACAAGGGT
GGCGACTACA AGGGCTTCAT GCAGCGCGTG AAGGACAAGG AAGAGCGCCT CATGGGCTTC
GGCCACCGGG TCTACAAAAA CTTCGACCCG CGCGCCAAGC TGCTGCGCAC GATGGCCGAC
GAGCTGCTCA CCCACCTCGG CATCCAGGAC CCGCTGCTCA ACATCGCCAA AGAGCTCGAG
CAGATCGCGC TGGCCGACGA GTACTTCATC GAGCGCAAGC TCTACCCCAA CGTCGACTTC
TACAGCGGCA TCGTCTACCG CGCCCTGGGC ATCCCGACCA ACATGTTCAC CGTGATGTTC
GCGCTCGGCC GCCTGCCGGG CTGGATCGCC CACTGGCGCG AGATGCACAA CGACCCGGGT
CGCCGCATCG GCCGCCCCCG CCAGGTCTAC GTCGGCGAGC AGAAACGCGA CTACGTGCCC
ATGGACCAGC GCAAGTAA
 
Protein sequence
MSNKAKLIIE GNEHELDIIE GSEGEKALDI RKLRADTGYI TMDSGYANTG AAESSVTYLD 
GEQGVLRYRG YPIEQLAENS SFTEVAYLVI YGKLPNKTEL SEFRELLTYH SMIHEGMRHF
FEGFPPSGHP MSILSSMVCS LSAYYPDCLE IDADDNMHVA RVLSKVRTIA AFAYKHFIGQ
PIMYPRNDLN YCANFLYMMF AVPAEPYEPS PEAVKALNML LILHADHEQN CSTSTVRLVG
SSNANLYASI SAGILALWGP LHGGANQAVI EMLEQIRDKG GDYKGFMQRV KDKEERLMGF
GHRVYKNFDP RAKLLRTMAD ELLTHLGIQD PLLNIAKELE QIALADEYFI ERKLYPNVDF
YSGIVYRALG IPTNMFTVMF ALGRLPGWIA HWREMHNDPG RRIGRPRQVY VGEQKRDYVP
MDQRK