Gene Hoch_2385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2385 
Symbol 
ID8544771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3302310 
End bp3303938 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID646387084 
Productprotein of unknown function UPF0118 
Protein accessionYP_003266815 
Protein GI262195606 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.322706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA ACGAGCCGCC TTCCCGAACC GAGCCCGCCG AGGCGGGCGT ACCCGAGGAG 
TCGGGCGACA GTCCGTCGAG TGGGCCGTCG AGCGGCGTGC CCAGCACGCC CATGCCGGGC
ACGCCCAGCA GTCCCCACCC CGAGGCCGCG AGCGAGCCCG GGCGCGCGTC CGAAGCCGCG
GGGATGAGCA GCGGCGCGCG CTACGGCGCG CCCAGCGGAC CGCTCGGCGG CGTGCTCAGC
GAGCCCTCGC GCGGCGTCCG CCGGGTGGCC GTGGAGCTGG GAAGCTGGCT CCGGCGGCTC
GCCCGGCTGT GGGGCTTTTT GGCCTTTGCT GTGCTGGTCG TGGTGCTCAC GCGCCACGTG
ATCCTGCCCT TTGTCTTCGC GCTGCTGCTC GCCTTCATCC TCGCGCCCGT GGTGCGCTAC
CTCTCGCACC GCGACGACGG CAGTCAGCGC ATGCCGCGCT GGGTGGCCAT CGTGCTGTGC
TATCTGGTCT TCCTCGGCGT GCTGGTGTCC AGCGCGGCCC TGGTGCTGCC GCGGTTTTCG
GACGACGCGT CGCGTATCGG GGCCAACCTG CCCACGCTCT ACGAGCGCGT CAACGACGAG
TGGGCGCCGG CGTTGGCCGA GTGGCTCGAG AGCAATTTTC CCGCCGACGA GGGCGTCAAA
CCGCTGCACG AGGAGGTGGT CACACCCGGA CAGTCGGTCT ATTTACCGCC CGACACCGCC
TTCGTGCTCA CGCCGCTCGA CGATGGCCGC CTGGCCGTGG AGCTGCCGCC CGGCAGCCTG
TCGATGCTGC CGCGGCCCGA CGGCGGCTTC GAGCTGCGCT CGGGCGAGCC GCCGGCCGAG
AGCGTCGACC TCGAGACCCG CATCCGGCTG TGGACCAGCC AGCGCATCGA GAACCTGCAG
ACCAGCATCG GCGACCTGGT GAGCTTCGGC CAGTCGCTGA TGATCGGTCT GGCGCGCAGC
GTCTTCACCT TCTTCCTGGT GATGATGGTC GGCGCCTTCA TCCTCCTCGA CCTCGAGAAG
CTGCACGGCT TCGCGCGCAG CTTGATCCCC ATGGCCTACC GCGACGACTT CGACCTCATC
GCCAAGGGCA TCAACCGCGG CCTCTCGGGC GTGATCCGCG GCCAGCTCAT CATCTGTCTG
GTCAACGGTG TGCTCACGTA CATCGGCCTG TTCATCTTCA ACGTCGAATA CGCGCTGGTG
CTCGCGCTGG TCGCCGGGAT CATGAGCGTG GTGCCCATCT TCGGGTCGAT CCTGTCGACC
ATCCCCGTCG TCATCGTCGC GCTGTTCTCG GGCGAGGGCG GGCTCGACGT CGCCCGCGCG
CTGGCTGCCT TCGCCTGGAT CGTGGGGATC CACCTGCTCG AGGCCAACGT CCTCGACCCC
AAGATCATGA GCTCGGCCGC GCGCATCCAC CCGGTGCTGG TCATCTTCGC GCTCATCGTC
GGCGAGCACT ACTTCGGCCT CGTCGGCGCG ATCCTGGCGG TGCCCGTGAC CTCGATGGTG
CAGGTGCTGT TCATGTATTT CCGCCGCAAG GCGTGGAAAC TCGACGCCGA GATGGCGCGC
GTGGAGCGCC GCGTCGAACG GCGCCTGAGC CGGCGCTATC GCCGCAGCAG CGACGCCGGA
CCGCGCTGA
 
Protein sequence
MSANEPPSRT EPAEAGVPEE SGDSPSSGPS SGVPSTPMPG TPSSPHPEAA SEPGRASEAA 
GMSSGARYGA PSGPLGGVLS EPSRGVRRVA VELGSWLRRL ARLWGFLAFA VLVVVLTRHV
ILPFVFALLL AFILAPVVRY LSHRDDGSQR MPRWVAIVLC YLVFLGVLVS SAALVLPRFS
DDASRIGANL PTLYERVNDE WAPALAEWLE SNFPADEGVK PLHEEVVTPG QSVYLPPDTA
FVLTPLDDGR LAVELPPGSL SMLPRPDGGF ELRSGEPPAE SVDLETRIRL WTSQRIENLQ
TSIGDLVSFG QSLMIGLARS VFTFFLVMMV GAFILLDLEK LHGFARSLIP MAYRDDFDLI
AKGINRGLSG VIRGQLIICL VNGVLTYIGL FIFNVEYALV LALVAGIMSV VPIFGSILST
IPVVIVALFS GEGGLDVARA LAAFAWIVGI HLLEANVLDP KIMSSAARIH PVLVIFALIV
GEHYFGLVGA ILAVPVTSMV QVLFMYFRRK AWKLDAEMAR VERRVERRLS RRYRRSSDAG
PR