Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3947 |
Symbol | |
ID | 8546343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 5444793 |
End bp | 5446019 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646388619 |
Product | carbamoyl-phosphate synthase, small subunit |
Protein accession | YP_003268339 |
Protein GI | 262197130 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.201135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00383855 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCACG AGGAAGACGC GCGCGAGGAC GCGCTGCTGC TGCTCGAGAG CGGCCGGGTG TTCCGCGGCC GGGCCCTGGG CGCGCGCGGC AAGGTGTTTG GCGAGGCGGT GTTCAACACC TCGATGACCG GCTACCAGGA GATCGTCACC GATCCCTCGT ATTCCGGCCA GATCGTGGTC ATGACCGCGC CCCAGATCGG CAACACCGGC ATCAACCGCG AGGATATCGA GTCGGCCAAG CCCATGTGCG CCGGCTTCGC GGTGCGCGAG GCCTCGCCGC TGGCCTCGAG CTGGCGCGCC TCGGGCGAGC TCGACGAGTA CCTGCGCGCC AGCGGCATCG TCGGCATCGA GGGCATCGAC ACCCGCGCGC TCACCCGCGC CCTGCGCACG GCCGGCGCCC AGCGCGCGGT GGTGGCCAGC GGCGCTCACG ACGACGAGGC CGTGGCCGCG CTGCTCGCCG AGGTGCGCGA GGCGCCGCAC ATGAACGGCC TCGACCTGGC CTCGCGCGTG ACCTGTGCCG AGCGCTACCA GTGGCCGCCG AGCGCGCCCG AGGCCGCCGA GGCCGCGCGC CAGCTCAGCG AGCGCTGGCT GCCCGCGAGC ACCCGCGTAG GCGCGGACAC CCGCGAGGAC GCGGGCGCGC GCTTTCACGT GGTCGCCTAC GACTTCGGCA TGAAGCACAA CATCCTCGCC TGCCTGGCGC GGCTCGGCTG CCGCGTCACC GTGGTGCCGG CCAGCACCTC GGCCGCCGAC GCGCTGGCGC TCGCGCCCGA CGGCGTGTTC CTGTCCAACG GCCCCGGCGA CCCGGCCGCG GTCGGCTACG CGGTCGAGGC CGTGGCCGAG CTGGCGGCCT CGGGCAAGCC GCTCTTCGGC ATCTGCCTGG GCCACCAGAT CCTGGCCCTG GCGCTGGGCG CGAGCACCTA CAAGCTGCCC TTTGGCCACC ACGGCGGCAA CCACCCGGTG CAGGACCTGG CCAGCAAGCG CATCGAGATC TCGGCCCATA ACCACGGCTT CGCCGTCGCC GAAGACTCGC TGCCCGAGAC CGTGCGCTGC ACCCACCGCA ACCTCTACGA CGACACCGTC GAGGGCATCG AGCTGGTCGG CGCGCCCGTG TTCGGCATCC AGTACCATCC CGAGGCCAGC CCCGGGCCGC ACGACGCGCT GGGCGTTTTC GACCGCTTCG TCGACGCCAT GGCCAGCGCG CAGGGCGAGG GCGGTACGAC GACGTGA
|
Protein sequence | MSHEEDARED ALLLLESGRV FRGRALGARG KVFGEAVFNT SMTGYQEIVT DPSYSGQIVV MTAPQIGNTG INREDIESAK PMCAGFAVRE ASPLASSWRA SGELDEYLRA SGIVGIEGID TRALTRALRT AGAQRAVVAS GAHDDEAVAA LLAEVREAPH MNGLDLASRV TCAERYQWPP SAPEAAEAAR QLSERWLPAS TRVGADTRED AGARFHVVAY DFGMKHNILA CLARLGCRVT VVPASTSAAD ALALAPDGVF LSNGPGDPAA VGYAVEAVAE LAASGKPLFG ICLGHQILAL ALGASTYKLP FGHHGGNHPV QDLASKRIEI SAHNHGFAVA EDSLPETVRC THRNLYDDTV EGIELVGAPV FGIQYHPEAS PGPHDALGVF DRFVDAMASA QGEGGTTT
|
| |