Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4144 |
Symbol | |
ID | 8335498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4683950 |
End bp | 4685179 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957247 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_003114849 |
Protein GI | 256393285 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0208419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGACG AGGAGCCCGG AGGGCTCGGC GGGTTCGCGG ACGAGGTCTC CCGCATGCTG CATGCCAGGG CGCTACTACA CCAGGAACAC ACCGACCCGG TCGGGATCAC CAGGAGGAAC CTGAAGATGG CTAAGCAACG TCGGCGTGCG GTGCTCGGAA CGACCACGAC GCTGGCGGTG ACCGGCGGTG CGGTGTTCGG CGTCTACGCA CTGTCCGGGC ATGCGCAGGA CAACGGCGGG ACTCGCAGCG TGGGGCAGAA CGTCGCTGCC GACTACACCG GGGCGGCGAC CTGCCCGGCG ACCGCGAAGG TCCCGGTGGA CGTCCCGCGC GGCGCGACCA CCACGCAGAT CGCGAACGCG CTGTTCACCG CCGGTGTGGT GGCCAGTCCG CAGGCGTACG TCGATGCTGC TGACCGGAAT CAGGGCTCTG TCGGCATCAC CGCCGGAACC TATGCGATCT GCCCGCAGAT CTCCGGCGCC AACGCGGTGC TGGAGCTGTC GAAGAAGTCG AACCTGTCGG ACGCCTCGCA GATCATCGTG ACCTCCCACG AGTGGTCGAA GGACGTCATC GCGAGCTTGG TCGACAAGCG GAAGTGGAAG CAGGCCGACT TCGACGCCGC GATCGCGAGC AACACGATCG GGCTGCCGGC GTGGTCGGTG GACTCCACGA GCCACAAGTT CACCGCCGAG GGCATGCTGG AGCCGGGGAC GTACTCGATC ACGTCGTCCG ACACGCCGCA GAGCATCCTG TCGCAGATGG TCGCCAAGCG GATGACGTAT TTCAAGAGCA TCGACTTCGA GAACAAAGCT GCGAGTCTGG TCTGTGGCGC CGCGAAGTGC ACGCCGGAGC AGGTGCTGAC GATCGCCTCG ATCGCCGAGG GCGAGGTCGC CGAACCCGGT GACGGCGCCC GCGTCGCCGA GGGTGTCTAC GCGCGCTTGA AGGCCGGGGA CTATCTCGCC GTGGACTCCA CGGCGCTGTA CGCCATCGGG CACCTCCCGG CCGGCCAGCT TCCGTCTGCC AAGCAGGTCC AGGATCCGAA CAACCCGTAC TCGACCTACG CGCCGCACCA CGGTCTGCCG CCGACGCCGG TCTACATCAC GTCCGACGAC ATGATCAAGT CCGCGCTCGC GCCGACCCAC GACGGCACCT ATTACTGGTG CGTCACCTCA ACCGGTGCCC GCTTCTTCAC CAAGGGCCAG GAGACGCAGC GCGATCAGGG CTGCTCGTAA
|
Protein sequence | MRDEEPGGLG GFADEVSRML HARALLHQEH TDPVGITRRN LKMAKQRRRA VLGTTTTLAV TGGAVFGVYA LSGHAQDNGG TRSVGQNVAA DYTGAATCPA TAKVPVDVPR GATTTQIANA LFTAGVVASP QAYVDAADRN QGSVGITAGT YAICPQISGA NAVLELSKKS NLSDASQIIV TSHEWSKDVI ASLVDKRKWK QADFDAAIAS NTIGLPAWSV DSTSHKFTAE GMLEPGTYSI TSSDTPQSIL SQMVAKRMTY FKSIDFENKA ASLVCGAAKC TPEQVLTIAS IAEGEVAEPG DGARVAEGVY ARLKAGDYLA VDSTALYAIG HLPAGQLPSA KQVQDPNNPY STYAPHHGLP PTPVYITSDD MIKSALAPTH DGTYYWCVTS TGARFFTKGQ ETQRDQGCS
|
| |