Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4084 |
Symbol | |
ID | 5901546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4429676 |
End bp | 4430755 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564604 |
Product | branched-chain amino acid aminotransferase |
Protein accession | YP_001685706 |
Protein GI | 167648043 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01123] branched-chain amino acid aminotransferase, group II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACGA CCAAGCCTCA CGCCAATCCA ACCCCGTCCG AGACCCGCGC GGCGCTGCTG GAAAATCCAG GCTTTGGCAA GGTATTCACC GACCACATGG TGACCGTGCG CTGGACGGCC GAGCGGGGCT GGCACGACGC TGAGGTGCGC GCCAGGGCTC CGTTCTCGCT CGATCCGGCC GCCGCCGTCC TGCACTACGC CCAGGAAATC TTCGAGGGCA TGAAGGCCTA CCGGACGCAG GACGGCGTGG CCCTGTTCCG GCCCGAGGAA AACGCCCGCC GCTTCGCGCG CTCCGCCGCG CGCATGGCCA TGCCGGAAGT GCCCGAGGAC CTGTTCCTGC GAGCGGTCGA GGCGCTGATC CGCGTCGACG CCGACTGGAT CCCCTCCGGC GAGGGCAGCC TCTATCTGCG CCCCTTCATG TTCGCCAGCG AGGCCTTCCT GGGCGTTCGT CCGGCGGCGG AATATATCTT CTGCGTGATC GCCTGCCCGG TCGGGGCTTA TTTCAAGGGC GGCGCCAAGC CGGTGACGGT CTGGGTCTCG CAAGACTACA GCCGGGCCGG GACGGGCGGC ACCGGCGACG CCAAGTGCGG CGGCAACTAC GCCGCCAGCC TGCTGGCCCA AGCCGAGGCC ATGCGCCATG GCTGCGACCA GGTGGTGTTC CTCGACGCGG CCGAGCATCG CTGGGTCGAG GAACTGGGCG GGATGAACGT GTTCTTCGTG CTGGACGACG GCAGCCTGGT CACCCCGCCG CTCGGGGGCA CGATCCTGCC GGGCGTCACC CGCGACTCGG TCATCGCCCT GGCGCGCGAC GCCGGCCTGA CGGTCGCGGA AAAGCCCTAT GCCTTCGACC AATGGCGCGC CGACGCCGCC AGCGGCCGCG TGCGCGAGGT CTTCGCCTGC GGCACGGCCG CCGTGATCGC GGCGATCGGC GAGGTCCGCT TCGGCGATGG CGATTTCAAG ATCAGCGGCG GCGTCGAGGG ACCCGTCACC CGCGACCTGC GCGAACGCCT GACCGGCATC CAGCGCGGAA CCCGCCCTGA CCCGAAGGGT TGGCGGCGTC CCGTCTCGAC GGGCGGCTGA
|
Protein sequence | MFTTKPHANP TPSETRAALL ENPGFGKVFT DHMVTVRWTA ERGWHDAEVR ARAPFSLDPA AAVLHYAQEI FEGMKAYRTQ DGVALFRPEE NARRFARSAA RMAMPEVPED LFLRAVEALI RVDADWIPSG EGSLYLRPFM FASEAFLGVR PAAEYIFCVI ACPVGAYFKG GAKPVTVWVS QDYSRAGTGG TGDAKCGGNY AASLLAQAEA MRHGCDQVVF LDAAEHRWVE ELGGMNVFFV LDDGSLVTPP LGGTILPGVT RDSVIALARD AGLTVAEKPY AFDQWRADAA SGRVREVFAC GTAAVIAAIG EVRFGDGDFK ISGGVEGPVT RDLRERLTGI QRGTRPDPKG WRRPVSTGG
|
| |