Gene Hoch_6441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6441 
Symbol 
ID8548856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8839414 
End bp8840982 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content73% 
IMG OID646391102 
Producthypothetical protein 
Protein accessionYP_003270803 
Protein GI262199594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCC AAGATCACCG GAAGGCGCCG CTCGATAGCA GAGCGCGCGC GCGGAAGCCG 
AGCTGGAACG TGCCGCCGCA GTTGACCGCG TTGGCGCTGG TCGCATGGGT GGCGCTCGCT
GGCGGCTGCA GGGACAGGGA CAGGGACAGG GACAGGGACG CACCGCCGCA GTCCGAGAAG
GTACAGGCGT CCGAGAACGC CGCAGCGATG CCGTCCGCCC CTGAGAATAC CGCGGCCGCG
GCACAAGCAC AGGCCGCAGC CGCGGCATCG GCCCAGGCCG CGGCCGACGA CCTCGATGCG
CGCATCGAGA CCGCGCTGCT GCGCGGCGCC TCGTATCTGG TCTCGCGCCA GGCCGCCGAC
GGCGCGTTCC GCTCGACCGA GGACGAGGCC CTGAGCAACG GGTACTCGCT GACCCCGCCC
GTGGTGCTGG CGCTGTGGGG GGCGACCCGG GGCGGCCAGG GCGCGACCGC GACCGCGAGC
GTGGCCGACA AAGCGCGCGC GCGCGGGGCC GGGTTTCTGG CCACCATCAT CACGCCCGAG
GGCGTGATTC GCACCGGCGA CGATGCGCCG CGCTTTCCCT ATCATGCCGC CGCGGGCGCG
CTGCTGGTGT TGAGTATGCA CGGCGCCCCG GAGCACGCCG AGGTGCGCGG CGCGCTGGTC
GCGTACCTGC TCGGCCTGCA GCTCACCGAG ACCCACGGCT ACCCGAGCAA TCACCCCTCG
TACGGCGGCT GGGGCTACAA CACGGCCATC CCCGAGTTCG AGGACGGGCA CGCCGAAGTG
CAGGTGAGCG CCAACCTGGC GATCACCGAG TACATCGTCG GCGCGCTGCG CATGAGCGGC
GTCGCGGCCG ACCATCCCGC CATGCAGAAG GCCGAGGTCT TCGTCTCGCG CTGTCAGAAC
TTCACCTCGC TGCCCGAGGG CGAGGGCGCG CCGGGCACCC CGGAGGTGAC CACGACCGAC
TGGGACGACG GCGGCTTCTT CCAGTCGCGC TCCGGACCCG ACAGCAACCT CGCCGGTGTC
GGCGGCATGG ACGTGCTCGG CCGCACGCGC TATCGCTCCT ACGGCTCGGC CACGGCCGAC
GGCCTGCGCA CGCTGCTGCA GCTCGGCTTC GGCGGCGACG ACCCGCGGGT CGCGGCCGCC
ACGCGCTGGC TGCTGCGCAA CTTCTCGGCC GTGCGCAACG CGGGCGCGTT TCCGCCCGAG
CTCGAAGTCC GGCGCGCGGC CTATCACTAC TACTACGCTC ACTCCGTGGC CCACGCCATG
CGCGCCATCG GCGCCGCCGA GATCGATATC GCCAGCGCCG AGCTGGTCGC CGCCGAGCAG
CCGCCCACCG ATGCCGAGAC CGCGGCCGAG GAGCCCGCCG CGACCCGGCG CGTGCACTGG
GCCGAGGCGC TGGCCGAAGC CCTGCTGGCG CGCCAGGACA AGGACGGCGC GTGGCGCAAC
CAGCACATCG AGGCGCGCGA GGACGACCCG CTGGTGGCCA CGCCCTACGC GCTCTCGGCC
CTGGCCATCG CGCGCATGGC GCTCACGGGC GAATCTCGGA CACACCGTCC AGATCAGGTG
CGGCTGTGA
 
Protein sequence
MKRQDHRKAP LDSRARARKP SWNVPPQLTA LALVAWVALA GGCRDRDRDR DRDAPPQSEK 
VQASENAAAM PSAPENTAAA AQAQAAAAAS AQAAADDLDA RIETALLRGA SYLVSRQAAD
GAFRSTEDEA LSNGYSLTPP VVLALWGATR GGQGATATAS VADKARARGA GFLATIITPE
GVIRTGDDAP RFPYHAAAGA LLVLSMHGAP EHAEVRGALV AYLLGLQLTE THGYPSNHPS
YGGWGYNTAI PEFEDGHAEV QVSANLAITE YIVGALRMSG VAADHPAMQK AEVFVSRCQN
FTSLPEGEGA PGTPEVTTTD WDDGGFFQSR SGPDSNLAGV GGMDVLGRTR YRSYGSATAD
GLRTLLQLGF GGDDPRVAAA TRWLLRNFSA VRNAGAFPPE LEVRRAAYHY YYAHSVAHAM
RAIGAAEIDI ASAELVAAEQ PPTDAETAAE EPAATRRVHW AEALAEALLA RQDKDGAWRN
QHIEAREDDP LVATPYALSA LAIARMALTG ESRTHRPDQV RL