Gene Hoch_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1036 
Symbol 
ID8543418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1322383 
End bp1323966 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content65% 
IMG OID646385789 
ProductSEFIR domain protein 
Protein accessionYP_003265524 
Protein GI262194315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGA CCATGCCCCA GTCCGAGCCG GAGAAGGCAT CAGCGTCGGC CCCTCGAGTT 
TTCCTCAGCT ACAGTCACGA CTCCCCCGAA CATCGTGATC GCGTCCTGGA CTTGGCTCAG
CGCCTACGGC GAGAGGGCAT AGACGCGTGG CTGGATCGCT TCACGCCGCA TCCGCCCGAG
GGCTGGCCGC GCTGGATGCA GCGCCAACTC GAGCAGGCCG ACCACGTGCT CGTGGTGTGC
ACCGAAACCT TTTGTCTCCG GTTCAACGGT CACGAGGAAC CGGACCGAGG CCTCGGTGCG
ACGTGGGAAG GGTTTTTGGC CACCCAGGTG CTCTACGAGA GCGGAACGCG CAACGACAAA
CTGATTCCGG TGCTCATGGA AGGCGCCCGG CAAAGCGATA TCCCTCTCGC GCTACGAGCC
TACACGCATT ACCGAGTGCC TGGCGGCTAT GATCAGTTAT ATCGGCAGAT AACGCAGCAA
CCCGAGGTCG TGCCCGCAAA CTTGGGTGAG GTGCGCGAGA TGCCGCAGCA GCAGACGCGA
GCAGCAGACG CGATACATGC TCGTGGCACG GAGCCGGGCG CGGTGCCTGA GATGTCGCCG
CTGTCTGAAG GCGTAGCGGA GACCCCGGGC GGACCAATCA GCCCGTTTTT GCCGGGCGTT
ATCGCCAAGC GAGCCGAAGA TTTTTTCGGA CGGCAACGCG CGCTGGACGA GATCACTCAT
TCGATTCACC ATCGCCAGCC CATTCAGATC CTAGGAGAGG CGCGCATGGG CAAGTCGTCG
CTGCTCGCCC ACGTGGCACG TACCCTCGTG CCCGGCGACA TGCAGGTGGC CGAAGTGCAC
GCCCGCGGCC GCTCGGGTTG GTCGCCACGC GAATTGATCT TGGCCATGGC CGACGCGCTC
GGTCAACGAG CGGTCGTCGA CAGCGTACTG CGGGCTGCGC CGGCGACGGT CGATGAGGCA
CCGGCGGCCG TAAACGCCTT GCAGTTCCTC CTTCCGTGCG CGCTGCTGCT GGACGACGCT
GACGCCATCG CCGACGCGGG GCACCACTTC GACCGCGCGT TTCTGGACGA ATGCCGCGCT
TTGACCCAGT CTCGCCGCCT GCTGTGGATT TCGGCCTCGC GCCGAGATTT GCAGAGCTGC
TTCCGCGAGA CCGGCCTCAG CTCCGAGCTT CTCAACAACT CGCGACGCGT TGTTATCGGT
CAGCTGGACA AAAAAGAGAC TGAACAGCTG CTGGGCGTGC TAGGCGCCTC CATGGATGAA
CGTTGCTACC GCCAGGCCGG CGGCTGGCCT GATGGTCTGC AATGGCTGGG GGACCGACTC
TGGCGCGACG GCGAGCGAGC GTCCACAGAC GATGATTTCG CTAACGCCAT GGAGCAAACC
TTCCGAAGTT GGTGGAAGTT GCGCACGCAG GCCGAGCACG CGCTTCTGCG GCGCCTGGTT
TTGCCCACCC CGATTACCGG ACTGTCGGAT AGCGAACGGA GGCGGGCGCG CAAGCTCGTG
TCTCGCGGTC TGCTCTGCGA GCGAGATGGC GCGTTCGCGC TGCTCGGCGC GGCCTGGGCG
AACTGGGTGC GCGATGTCGA GTGA
 
Protein sequence
MIPTMPQSEP EKASASAPRV FLSYSHDSPE HRDRVLDLAQ RLRREGIDAW LDRFTPHPPE 
GWPRWMQRQL EQADHVLVVC TETFCLRFNG HEEPDRGLGA TWEGFLATQV LYESGTRNDK
LIPVLMEGAR QSDIPLALRA YTHYRVPGGY DQLYRQITQQ PEVVPANLGE VREMPQQQTR
AADAIHARGT EPGAVPEMSP LSEGVAETPG GPISPFLPGV IAKRAEDFFG RQRALDEITH
SIHHRQPIQI LGEARMGKSS LLAHVARTLV PGDMQVAEVH ARGRSGWSPR ELILAMADAL
GQRAVVDSVL RAAPATVDEA PAAVNALQFL LPCALLLDDA DAIADAGHHF DRAFLDECRA
LTQSRRLLWI SASRRDLQSC FRETGLSSEL LNNSRRVVIG QLDKKETEQL LGVLGASMDE
RCYRQAGGWP DGLQWLGDRL WRDGERASTD DDFANAMEQT FRSWWKLRTQ AEHALLRRLV
LPTPITGLSD SERRRARKLV SRGLLCERDG AFALLGAAWA NWVRDVE