Gene Hoch_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2186 
Symbol 
ID8544572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3046148 
End bp3047317 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID646386893 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_003266624 
Protein GI262195415 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0893637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGT TTCAGAACTA TCTCAAGGGA ATCAAGAACG AGATCAAGGA GATCGACGCC 
GCAGGCATGA AGGAGGTCCT CGACAAGGGG GATTCCCATG TCGCCGTGAT CGATGTCCGC
GAGAAGGACG AGTACGTCCA GGGGTACATC CCCGGGGCGA CATGGATCCC GCGTGGCTTT
CTCGAGCTGC GCATCGAGGA GGCGGTGCCC GAGCGTGACC AGCCGGTGGT GCTGTACTGC
GCCGGCGGCA CGCGTTCGGC CCTGGCCGCG CGCGCGCTGC ACGAGCTCGG TTACACCGAC
GTCACCTCGA TGGCCGGCGG CTTCACTTCG TGGAAGCGCA ACGGCAACCG CTTCGAGATG
CCGGTGGTGA TGACCCCGGA GCAGGAGATG CGCTACGCGC GTCACACCAT GCTGCCCGAG
GTCGGGGTCA AGGGGCAGGT CAAGCTGCTC GAGTCCAAGG TGCTGTGTCT GGGCGCCGGC
GGTCTGGGAT CGCCCTCGGC CCTGTACCTG GCCGCGGCCG GCGTCGGCAC CATCGGCATC
GTCGATGACG ACGTGGTCGA CGCCTCGAAC CTGCAGCGCC AGATCCTGCA CGCCACCGAT
CGCGTCGGCA TGTCCAAGGT GGACAGCGCC GAGAAGACGT TGAACGGGCT CAACCCCGAC
GTCCAGATCA AGAAGTTCCA GGAGCGGCTG ACCTCGGACA ACGTGCTCGA GATCATCAAG
GACTTCGACG TCATCGTCGA CGGCGCCGAC AACTTCCAGA CCCGCTACCT GCTCAACGAC
GCCGCGCTCA AGCTGGGCAA GCACGTCGTG CACTCGTCGA TCTACCGCTT CGAAGGCCAG
CTCACCGTGT TCACGGGCGA CGGCCACCCG TGCTACCGCT GCCTGTATCC CGAGCCGCCG
CCGCCCGAGG AGGCACCCTC GTGCCAGGAG GCCGGCGTGC TCGGTGTGCT GCCCGGCATC
ATGGGCGTTC TGCAGGCCAC CGAGGCGGTC AAGCTCATCC TCGGCATCGG CACCAGCCTG
TCGGGTCGCC TGCTGGTCTT CGACGCGCTC AAGACCAAGT TTCGCGAGCT CAAGCTGCGC
CAGGACCCCA ACTGCCCGAC CTGCGGGGAG GGCGTCGATC GCTCGCAGAT CGAGCTCATC
GACTACGTGC AGTTCTGCGC CGGCGCCTGA
 
Protein sequence
MPTFQNYLKG IKNEIKEIDA AGMKEVLDKG DSHVAVIDVR EKDEYVQGYI PGATWIPRGF 
LELRIEEAVP ERDQPVVLYC AGGTRSALAA RALHELGYTD VTSMAGGFTS WKRNGNRFEM
PVVMTPEQEM RYARHTMLPE VGVKGQVKLL ESKVLCLGAG GLGSPSALYL AAAGVGTIGI
VDDDVVDASN LQRQILHATD RVGMSKVDSA EKTLNGLNPD VQIKKFQERL TSDNVLEIIK
DFDVIVDGAD NFQTRYLLND AALKLGKHVV HSSIYRFEGQ LTVFTGDGHP CYRCLYPEPP
PPEEAPSCQE AGVLGVLPGI MGVLQATEAV KLILGIGTSL SGRLLVFDAL KTKFRELKLR
QDPNCPTCGE GVDRSQIELI DYVQFCAGA