Gene Huta_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1304 
SymbolcofG 
ID8383581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1276479 
End bp1277585 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID644972365 
ProductFO synthase subunit 1 
Protein accessionYP_003130213 
Protein GI257052380 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.366684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCGAA ACCCGGACGT GATCCCCGGG GCCGAGGAAT ACGACGTCGA CGTCACGATC 
GATCCGGCCG AGCGCGAGCG ACTGCTGTCG GTCGGCCCCG AGGACGTCGC CGGACCCGGC
GAGGACGGCG GCCCCGACCA CCTCTCCTTT GCCAGAAATG TCTTCATCCC ATTGACGACG
GCCTGCCGGT ACACCTGCAC CTACTGTACG TACTACGATC CGCCGGGCCA GGCCTCGTTG
CTTTCGCCCG AAGACGTCCG CGAGATCTGC CGGGAGGGGG CCGACGCCGG CTGTACGGAA
GCCCTCTTTA CCTTCGGCGA CGATCCCGAC GACCGCTACG ACGCAATCTA CGACCAACTC
GCCGAGTGGG GCCACGACTC GATTCACACC TATCTCCGGG AGGCCTGCGA GATCGCGCTG
GAGGAGGGAC TGCTGCCCCA CGCCAATCCG GGCGATCAGA CCCGCGAGCA GATGGCCGAA
GTCGCCGATC TGAACGCGAG CATGGGTGTG ATGCTAGAGA CAACCGCCGA TCTTGAGGCC
CACTCGGGTT CGCGCCGCAA AGAGCCGGGC CAACGACTCG CAACGATCCG GACGGCAGGG
GAACTCGGCG TGCCTTTCAC AACCGGGATT CTGGTCGGCA TCGGCGAGGA CTGGGCGGAT
CGCGCCGAGA GCCTGCTGGC AATCGCTGCC CTCCACGAGC GGTACAACCA CGTCCAGGAG
GTGATCGTCC AGCCCGTTTC GCCGAACGAA CGCTGGGATC GCGAGCCGCC GAGTCTGGAG
ACGATGCGCC GGACGGTCGC GATGGCACGG GCGGGATTGC CAGAGACGGT CAGCGTCCAG
GTCCCGCCGA ATCTGGCCCG GACGCGCGAC CTGCTCGACT GCGGCGTCGA CGATCTTGGC
GGTGTCTCCC CGGTCACCGA TGACCACGTC AATCCCGACT ACGCCTGGCC GGCACTGGAC
GAACTCCGCG CGATCGCAGA CGATGCGGGT GTCCCGCTAC GCGAGCGGTT ACCGGTCTAC
GATCGCTACG TGGACGAAGA CTGGCTGAGC GAGCAGGTAT TGGCAACGGT CTCGACAGTG
ACATCGACGG ACGAAACAGG GACCTGA
 
Protein sequence
MIRNPDVIPG AEEYDVDVTI DPAERERLLS VGPEDVAGPG EDGGPDHLSF ARNVFIPLTT 
ACRYTCTYCT YYDPPGQASL LSPEDVREIC REGADAGCTE ALFTFGDDPD DRYDAIYDQL
AEWGHDSIHT YLREACEIAL EEGLLPHANP GDQTREQMAE VADLNASMGV MLETTADLEA
HSGSRRKEPG QRLATIRTAG ELGVPFTTGI LVGIGEDWAD RAESLLAIAA LHERYNHVQE
VIVQPVSPNE RWDREPPSLE TMRRTVAMAR AGLPETVSVQ VPPNLARTRD LLDCGVDDLG
GVSPVTDDHV NPDYAWPALD ELRAIADDAG VPLRERLPVY DRYVDEDWLS EQVLATVSTV
TSTDETGT