Gene Huta_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2539 
Symbol 
ID8384844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2605293 
End bp2606417 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID644973616 
ProductCitrate (Si)-synthase 
Protein accessionYP_003131436 
Protein GI257053603 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.64966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACG AGGAGATACA CCGCGGACTG GCGGACGTGA CGGTCACCGA AACGCGGCTG 
AGCGACATCG ACGGCGAGGC CGGCCAGCTG TGGATCGCGG GGTACCCAGT CGCAGACCTG
GCGGCGAACG CGACCTACCC CGAGACAGTC TATCTGCTGT TGCACGATCG CCTGCCGGAC
GCCGAGGAAC TGCAATCCTT CGAGGATCGG CTGTGTTCGT ACCGGACGCT GCCGGAACCC
TGCCACGATG CCGTCGTCGC GGCCGCCCAG CGCGGGGCCG GGCCGATGGC CGCCCTCCGG
ATGGGCGCGG CGACAGCCAC GGCGGTCGAG CCGAACGATC CCGAGGCCGA CGCCCTCCGG
TTGATCGCCC GGCTGCCGAC GATCACGGCG ACCTACTGGC GCGTGCTCCA GGGCCAGGAA
CCGCTCGAAC CACGGCTCGA TCTCGGCCAC GCCGCCAACT ATCTCTACAT GCTGACCGGC
GAGGAGCCGA CCGATGCCCA GGTCGCGGGC CTGGAGACGT ACCTCTCTAC CGTCGTCGAT
CACGGCCTCA ACGCTTCGAC GTTCACCGCG CGAACGATCG TCTCGACGGA GTCCGAGCTG
GTCTCGGCGA TCACCGGGGC GATCGGCGCG CTGCGGGGGG ACCTCCACGG CGGTGCGCCG
GACCTGGTTC TGGAGATGCT CGAATCGCTA GAGGAGAGCG AGGATGTCCG CGGCGAACTC
GGGGCGCGGC TCGAAGCCGG GGAACGACTG ATGGGCTTTG GCCACCGGGT GTACGGCGCG
CGCGACCCGC GAGCGGCAGT CTTAGAGGAC GCCGCCGCGT CATTTTACGA GGGTGAGGAC
GATTTCTTCG CCGCGGCCAA AGCAATCGAG GACGTCGCGA CCGACCTCCT GGCCGAGCAC
CGCCCCGACC TGGACCTGGA GACGAACGTC GAGTTCTACA CCGCCGTCCT GCTCCACGGT
GTCGGGATTC CGCCGGAACT GTTCACGCCG ACGTTCGCGA TCTCGCGGGT CGCCGGCTGG
AGCGCGCACT GTCTCGAACA ACTCGAGGAC AACCGGCTGA TCCGCCCGCG GAGCGAATTC
GTCGGCGAGC ACGACCGCGG GTGGGTGCCG CTCGACGAGC GATAA
 
Protein sequence
MSDEEIHRGL ADVTVTETRL SDIDGEAGQL WIAGYPVADL AANATYPETV YLLLHDRLPD 
AEELQSFEDR LCSYRTLPEP CHDAVVAAAQ RGAGPMAALR MGAATATAVE PNDPEADALR
LIARLPTITA TYWRVLQGQE PLEPRLDLGH AANYLYMLTG EEPTDAQVAG LETYLSTVVD
HGLNASTFTA RTIVSTESEL VSAITGAIGA LRGDLHGGAP DLVLEMLESL EESEDVRGEL
GARLEAGERL MGFGHRVYGA RDPRAAVLED AAASFYEGED DFFAAAKAIE DVATDLLAEH
RPDLDLETNV EFYTAVLLHG VGIPPELFTP TFAISRVAGW SAHCLEQLED NRLIRPRSEF
VGEHDRGWVP LDER