Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3941 |
Symbol | |
ID | 8546337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5435323 |
End bp | 5436534 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646388613 |
Product | hypothetical protein |
Protein accession | YP_003268333 |
Protein GI | 262197124 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0778882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0100939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG CCGCCCCCCT GCCCTCGGCC GCCGAGATCG ACGCTCTCGC GGCTCGCCTG CGCCCGCACT ACCGGCGCTT CCTCGACGCC TGCGCGGGCG AGGTGCTGCT CACCGCGCAC TCGCACCAGG CCTGGCCCGA TGTCTCGCGC GAGGGCCACA TGGCGGCCTG GGACGACGCC GCGCGCCTGG CCGACCGCAA GTGGTCGCGC ATTCTCGATG AGGTGCTGCC GGCGTTTCGC GAGCGCGTGG CCCAGCGCCT GGGCAGCTCG CGCCCCCGCG ACCTGGCCAT CGCGCCCAAC ACCCACGAGC TGGTGTACCG CCTGGCGAGC TGCTTCCCGC GCGACGCGAC GGTGCTCACC AGCGACGCCG AGTTCCACTC GCTGCGGCGC CAGCTCGTGC GCCTGAGCGA GGACGGCACC AAGGTGGTGA ACGTGGCCAC GGCCGGCGAC GACTTCGGCG CCCGCTTCCT CGCCGCCATC GACGAGCACC GGCCGAGCTG GGTGGCGCTG TCGCAGGTGC TGTTCACGAA CTCGCGCATC GTCACCGAGC TGCCGCGCAT CCTCGCGGCG CTGGCCGCGC GCCAGGTGCC GGCGCTGGTG GACGCGTATC ACGCCTTCAA TGTCGTGCCC ATGGACGTGG ACGCGTGGCC GGGGACGGTG TTCGTGACCG GCGGCGGCTA CAAGTACGCG CAGTCCGGCG AGGGCGCGTG CTGGATGCTG CTGCCGGCGG ATGCCGAGCG CTACCGGCCG CGTCAGACCG GCTGGTTCGC CGACTTCGCG CATCTCGAGG AGGGCGCCAG CGCGGTCGAG TACGGGCCCG GCGGGCAGCG CTTCTTCGGC TCGACCTTCG ACGCCGCGGG CATCTACCGC GGGCTCTACG TGCTGCGCTG GATGGACGAG ATGGGGCTGA CGCCGAGCGT GCTCGCGGCC CACGCGCAGG CGCGTACCCA GCGCATTGTC GACGCCTTCG ACCGCCTGGC GCTGGAGCGC GCCGGGCTGC GCCTGGCCTC GCCGCGCGAG CCCGAGCGCC GCGGCGGCTT CGTGGCGATC GCGAGCGAGG GCGCCAGCGC GCTGGCCGCG GCCCTGGCCG AGGCCGGCGT GCGCAGCGAC GTGCGCGGCC ATCTGCTGCG CCTGGGCCCG GCTCCGTACC TCGACTGCGG CGACATCGAT CGCGCCATGG ACGCGCTGGC CGCGGCCGCC GCGCGCGGCT GA
|
Protein sequence | MTQAAPLPSA AEIDALAARL RPHYRRFLDA CAGEVLLTAH SHQAWPDVSR EGHMAAWDDA ARLADRKWSR ILDEVLPAFR ERVAQRLGSS RPRDLAIAPN THELVYRLAS CFPRDATVLT SDAEFHSLRR QLVRLSEDGT KVVNVATAGD DFGARFLAAI DEHRPSWVAL SQVLFTNSRI VTELPRILAA LAARQVPALV DAYHAFNVVP MDVDAWPGTV FVTGGGYKYA QSGEGACWML LPADAERYRP RQTGWFADFA HLEEGASAVE YGPGGQRFFG STFDAAGIYR GLYVLRWMDE MGLTPSVLAA HAQARTQRIV DAFDRLALER AGLRLASPRE PERRGGFVAI ASEGASALAA ALAEAGVRSD VRGHLLRLGP APYLDCGDID RAMDALAAAA ARG
|
| |