Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0412 |
Symbol | |
ID | 4285546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 493089 |
End bp | 493889 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638139875 |
Product | branched chain amino acid: 2-keto-4-methylthiobutyrate aminotransferase |
Protein accession | YP_755643 |
Protein GI | 114568963 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGTA TCGCCTTCAA GGACGGGGAC TGGCTGGAAA CCGGGCAGAC CGGATGGGCG CTGGCGGACC GCGGCGTCCT GCTGGGCGAT GGCCTGTTCG AAACCCTGCA TGTGATCAGG GGCAAGGTCG TCCGGCTCGA CCGCCACATG GCGCGACTGA CCCGCAGTGC GGCCGAGCTG GGCCTGCCGG GGCCACGTGA CGGTGACAGT ATCGCCGAGC TGGTCGCTGA ACTGGTCGCC CGCAATGCCC TGAAGGACGC CATTGTGCGC CTGACCCTCA CGGCGGGACC CGGGCTGCGC GGGTTGGAGC GACCGGAGGA GCTGGTTCCC TCACTGACCC TGACCGCCGC ACCGCGGCTG GCGCCGCCGG CCTCGATCCG TCTGGCGCTC AGTGAAGTCC GGCGCTCGCC GGCCAGCCTC GCCGCGCGTC ACAAGACACT CTCTTACATG GACAACATCC AGGCCCGGCG GCAGGCGCGC GGGCAGGGGG CCGACATGGC CTTGCTGCTG GATACGCGCG GCAATGTGTC CGGGTGTGAT TGCGCCAATG TGTTCTGGCT CATTGGCGGC GAGGTCTACA CGCCGGCTAC CGCCTGCGGC GTGCTGGCCG GAACCGTGCG GGCCGAGATC GTGGACTCGA TGCCGGTCGA GACCGGCGCC TTCGGGCTGG ATGTGCTGGA GGGTGCCGAA GCGGTGTTCG TCACCAATGC CGCTTTCGGC GCGGTGCCGG TGACCGAGCT GGACGGGCGA CCGCTGGGAT CCGGTGAGTT GCCGGCCCGG ATCCGGGCTC TCTTCGCCTA G
|
Protein sequence | MTGIAFKDGD WLETGQTGWA LADRGVLLGD GLFETLHVIR GKVVRLDRHM ARLTRSAAEL GLPGPRDGDS IAELVAELVA RNALKDAIVR LTLTAGPGLR GLERPEELVP SLTLTAAPRL APPASIRLAL SEVRRSPASL AARHKTLSYM DNIQARRQAR GQGADMALLL DTRGNVSGCD CANVFWLIGG EVYTPATACG VLAGTVRAEI VDSMPVETGA FGLDVLEGAE AVFVTNAAFG AVPVTELDGR PLGSGELPAR IRALFA
|
| |