Gene Hoch_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3941 
Symbol 
ID8546337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5435323 
End bp5436534 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content75% 
IMG OID646388613 
Producthypothetical protein 
Protein accessionYP_003268333 
Protein GI262197124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0778882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0100939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG CCGCCCCCCT GCCCTCGGCC GCCGAGATCG ACGCTCTCGC GGCTCGCCTG 
CGCCCGCACT ACCGGCGCTT CCTCGACGCC TGCGCGGGCG AGGTGCTGCT CACCGCGCAC
TCGCACCAGG CCTGGCCCGA TGTCTCGCGC GAGGGCCACA TGGCGGCCTG GGACGACGCC
GCGCGCCTGG CCGACCGCAA GTGGTCGCGC ATTCTCGATG AGGTGCTGCC GGCGTTTCGC
GAGCGCGTGG CCCAGCGCCT GGGCAGCTCG CGCCCCCGCG ACCTGGCCAT CGCGCCCAAC
ACCCACGAGC TGGTGTACCG CCTGGCGAGC TGCTTCCCGC GCGACGCGAC GGTGCTCACC
AGCGACGCCG AGTTCCACTC GCTGCGGCGC CAGCTCGTGC GCCTGAGCGA GGACGGCACC
AAGGTGGTGA ACGTGGCCAC GGCCGGCGAC GACTTCGGCG CCCGCTTCCT CGCCGCCATC
GACGAGCACC GGCCGAGCTG GGTGGCGCTG TCGCAGGTGC TGTTCACGAA CTCGCGCATC
GTCACCGAGC TGCCGCGCAT CCTCGCGGCG CTGGCCGCGC GCCAGGTGCC GGCGCTGGTG
GACGCGTATC ACGCCTTCAA TGTCGTGCCC ATGGACGTGG ACGCGTGGCC GGGGACGGTG
TTCGTGACCG GCGGCGGCTA CAAGTACGCG CAGTCCGGCG AGGGCGCGTG CTGGATGCTG
CTGCCGGCGG ATGCCGAGCG CTACCGGCCG CGTCAGACCG GCTGGTTCGC CGACTTCGCG
CATCTCGAGG AGGGCGCCAG CGCGGTCGAG TACGGGCCCG GCGGGCAGCG CTTCTTCGGC
TCGACCTTCG ACGCCGCGGG CATCTACCGC GGGCTCTACG TGCTGCGCTG GATGGACGAG
ATGGGGCTGA CGCCGAGCGT GCTCGCGGCC CACGCGCAGG CGCGTACCCA GCGCATTGTC
GACGCCTTCG ACCGCCTGGC GCTGGAGCGC GCCGGGCTGC GCCTGGCCTC GCCGCGCGAG
CCCGAGCGCC GCGGCGGCTT CGTGGCGATC GCGAGCGAGG GCGCCAGCGC GCTGGCCGCG
GCCCTGGCCG AGGCCGGCGT GCGCAGCGAC GTGCGCGGCC ATCTGCTGCG CCTGGGCCCG
GCTCCGTACC TCGACTGCGG CGACATCGAT CGCGCCATGG ACGCGCTGGC CGCGGCCGCC
GCGCGCGGCT GA
 
Protein sequence
MTQAAPLPSA AEIDALAARL RPHYRRFLDA CAGEVLLTAH SHQAWPDVSR EGHMAAWDDA 
ARLADRKWSR ILDEVLPAFR ERVAQRLGSS RPRDLAIAPN THELVYRLAS CFPRDATVLT
SDAEFHSLRR QLVRLSEDGT KVVNVATAGD DFGARFLAAI DEHRPSWVAL SQVLFTNSRI
VTELPRILAA LAARQVPALV DAYHAFNVVP MDVDAWPGTV FVTGGGYKYA QSGEGACWML
LPADAERYRP RQTGWFADFA HLEEGASAVE YGPGGQRFFG STFDAAGIYR GLYVLRWMDE
MGLTPSVLAA HAQARTQRIV DAFDRLALER AGLRLASPRE PERRGGFVAI ASEGASALAA
ALAEAGVRSD VRGHLLRLGP APYLDCGDID RAMDALAAAA ARG