Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3654 |
Symbol | |
ID | 5901109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3943232 |
End bp | 3944353 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564165 |
Product | aminotransferase class I and II |
Protein accession | YP_001685279 |
Protein GI | 167647616 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.817087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.196766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGTTG TCGATCAAAA GTCCCTGGCG CCCGCCGTGG TTTCGCCCGT GCGTCCGGCC CTGGAAGGCG TCACCGCCTA CAAGGCCGGC ATGACCCTGG CGCAGGCTGG TCGCCTGGCC GGTCGCGTCG ACCTGTCCAA GCTGGCCAGC AACGAGAACC TGCTGGGCTG CAGCCCCAAG GTCGCGGACG CCGTGCAGGC CGCCCTGGCC AGCCCCGAAA TCTATCCCGA CCCCTATTGC GAGACCCTGC GCGCCGCGAT CGGCGAGCGG CTGAAGGTCG ACCCGGCCCG CATCGTCATG ACCCCGGGCT CCGAGGCCCT GATCGACTAC CTGTTCCGCG CCGTGCTGCA CCCGGGCGAC AGCATCCTGC TGAGCTCGCC CACCTTCCCG GCCTACGAGA TCTTCGGCCG CTGCGCGGAG GCAAGCTTCA TCGACGTGCC GCGCCTGGGC AATTTCGACC TTGATGTCGA AGCCTTCAAG GTCGAGGCCG CCAAGGGACC CAAGCTGCTG GTGCTGTGCA CCCCCAACAA CCCGACCGGC AACGCCCTCA GCGGCGTCGA TATCGGCGAG ATCCTGGCCG TCACCCCGCG CTCGACCGTG GTGTTCATCG ATGAGGCCTA TCGCGAATAT CACGAGGACT TCGACACCCT GGCTCTGCTG AAGGCCTGGG GCGGGACCTG GGTCTCGGGC CGCACCTTCT CCAAGGCCTA TGGCCTGGCG GGGATGCGCG TGGGCTACGG GATCTGCTCG TCAGCCGAAT TGGTCGGCTA TCTGGACCGC ATCCGCCCGC CGTTCAACGT CACCGCCATC AGCCAGGCCG CCGCCCTGGC CGCCTGGAAC GATCGGGAAC ACCTGGAGCG CACCGTGGCC CTGACGCTCA CAGAGCGGGC GAGGGTCGAG GCCGTGCTGG ACGCCATGGG TGTCGAGCGC ACCAAGAGCG CCGCCAACTT CGCGTTCCTG CGGGCCAAGG TCGGCGCCGA TGCTGTCGCC ATGCGTTTGC TGAAAGAGGG GCTGATCGTG CGCCCCACCC CGGTGCCCGG CGGCTGGGTG CGGATCACCA TCGGCCGTCC CGCCGACAAC GACCGCCTGA TCGCCGCCCT GCCGGCCGCG CTGGCTGTGT GA
|
Protein sequence | MLVVDQKSLA PAVVSPVRPA LEGVTAYKAG MTLAQAGRLA GRVDLSKLAS NENLLGCSPK VADAVQAALA SPEIYPDPYC ETLRAAIGER LKVDPARIVM TPGSEALIDY LFRAVLHPGD SILLSSPTFP AYEIFGRCAE ASFIDVPRLG NFDLDVEAFK VEAAKGPKLL VLCTPNNPTG NALSGVDIGE ILAVTPRSTV VFIDEAYREY HEDFDTLALL KAWGGTWVSG RTFSKAYGLA GMRVGYGICS SAELVGYLDR IRPPFNVTAI SQAAALAAWN DREHLERTVA LTLTERARVE AVLDAMGVER TKSAANFAFL RAKVGADAVA MRLLKEGLIV RPTPVPGGWV RITIGRPADN DRLIAALPAA LAV
|
| |