Gene Hoch_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2271 
Symbol 
ID8544657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3161432 
End bp3162589 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content72% 
IMG OID646386976 
Producthypothetical protein 
Protein accessionYP_003266707 
Protein GI262195498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0830218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTC CCAAGACCCT CTCGCGCTTG TCACAGGCCC CGCAGGATCC CGAGGAATTG 
CGACGCGTTG CTCGCACCCT GGCCAGTACC TTGCACAGTA AGCGCGGCAC GCAGACGAAC
GGCGCGCGCG CGTACCTGGC GCAGCACGCC ACCAAACACG GCGATTTCGC GGCCGGCTAC
AGCGCCAGCC TGTGTGATAT CGCGGTCGAA CACAGTCGCG CCGCGGCCGA AGACGACGAC
ATCGAATGCG CGCGCCGGCT GCTCGTCGAC CGGCAGATGC GCGAGCTGCT GGCGCGTCTG
GCCGTCGAGC CAGCGCGGCC GAGGGCGCTG GCCGAGGTCG TGGAAGAGAG CACGGCCCAG
GTCGCCGAGC GCCTCGATCA TCTCGCTGCC GTGGGGCTGG TGCAGGCCTA CGCCGCGGGC
ACGGACGAGC GCCACATGGC CGTGTATCGG GCCACGCGCA CGGGCCGACG CTTGCTCGAC
GAGCTCGGCC CGAACCTGAG CACGCCCATC GAGCAGGGCA TCCGCCTGGC CGTCTCGCTT
TTCGACTACC TCGCGCAGCA CCAGCTCAGC CCGGCCTCGG CGCTGCACGA GATCGCCGAA
GAGCTGCTCC ACGATCCCGC CGCCGCGGTC GCCGCCGTCC GCGCCTGGGC CGAGGCCGCG
AGCGAGCGCG GCCTGGTCGA TGAATTCGGC AGCGCGCCCC TGGCCGAGGG CACCGGCGCC
AAGCGCGCGC CCGGCTACCG CGCCAGCACC AGCGCGGCCG GCGAGCTGCG CTCCGCGCAT
CTGTGGCGCG AGGCCCCGGC CCTGCTCGAG CAGCTCGGCA GCGAACGCGC CGCGCCCGTG
TACGTGCGCA CCGATCCCGC CGGCTGGAGC GCCTGGGCCT TCGCCCTCAA CAGCCGCGAC
CACAGCGGCC GCTCGCGCAC CATCGTCGAC GGCGACATCC TCGCGCAGTC CGTAAGCCCG
CCCGAGCACG GCTTCGACCT GGTCTACGAC CGCCGCGACA CCCTCGACAG CGACAGCCGC
GAGCCGACCA TGCGCGCCTT TCTCGAGCGC GCCGAGCAGC GCTTTCTCAT CGCGGCCGAT
GACGAGGACG TCCCCGAGGG CTTCATCCGC CTGGCGCCGC CGCCGCCCGA CAGCGACAGC
GACAACGCGC CGAGTTGA
 
Protein sequence
MNVPKTLSRL SQAPQDPEEL RRVARTLAST LHSKRGTQTN GARAYLAQHA TKHGDFAAGY 
SASLCDIAVE HSRAAAEDDD IECARRLLVD RQMRELLARL AVEPARPRAL AEVVEESTAQ
VAERLDHLAA VGLVQAYAAG TDERHMAVYR ATRTGRRLLD ELGPNLSTPI EQGIRLAVSL
FDYLAQHQLS PASALHEIAE ELLHDPAAAV AAVRAWAEAA SERGLVDEFG SAPLAEGTGA
KRAPGYRAST SAAGELRSAH LWREAPALLE QLGSERAAPV YVRTDPAGWS AWAFALNSRD
HSGRSRTIVD GDILAQSVSP PEHGFDLVYD RRDTLDSDSR EPTMRAFLER AEQRFLIAAD
DEDVPEGFIR LAPPPPDSDS DNAPS