Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0045 |
Symbol | |
ID | 3747244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 51249 |
End bp | 52259 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637772571 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_378367 |
Protein GI | 78188029 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.266308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCTC CTCGCTCTTT TATTACTCGC CTAATTCTTG CGGTTACTTT GCTGATAGCC GCATTCCCGC TTGGCTTTTT GTTGATACCG GGACTTAATA GTAAGAGCAA ACCAACGCAA TTAGTGGTGC ATCGTGAAAT GCGCTTTAGC GATGTGCTCG ACAAGTTGCA AGCAAGTGGC GCAATTCGTG AACGGTGGCA GCCAGAGCTA ATTGCACGCA TGGTACCAAA ATTCCGCACG ATAAAAGCTG GACGCTACAC CATTCCCCCC AACACCTCGA ACTTTGGCTT ACTGTGGTAC CTCCGCACGC ACCCGCTTGA CGAAGTGCGC GTTACCCTGC CCGAAGGTAT TGATAGACGC AAAATGGCAC GCATTCTTTC GCGCAAGCTT GATTTTGACT CCACGCAGTT TATGGCTGCA ACCGAAAATC CTCGTTTACT TGCAAAATAT GGCATTCGTG CCAGCCACGC CGAAGGCTAC TTGCTACCAG GAACCTATGA TTTTGCATGG GGCAGCTCAC CTGATGAAGC GGCAAGCTTT CTTATTCGTC AATTCAAAAA ATTGTACACC ACCGAACGGC AGCAACGCGC CGCAGCGCTT GGCTTTAACG AGCATAGCCT CCTGACGCTT GCCTCCATTG TAGAAGCCGA AACGCCGCTT GATAAAGAAA AGCCCACGGT AGCCAGCGTT TACTTACATC GCTTACGCAT TGGAATGCGC TTGCAAGCTG ACCCTACCGT CCAGTACGCC CTTGGAGGCA CCACACGCCG CTTGTACTAC AAAGACCTTG CAATTGCCTC CCCCTACAAC ACCTACCGCA ATAAAGGTTT ACCCCCTGGA CCTATCTGCA ATCCGGGCAA AGCTTCAATT ATAGCCGTCT TAAACGCCCC TCAAAGTGGC TATCTCTATT TTGTAGCAAC AGGTACAGGC GGACATTACT TTGGCGCATC GCTACAAGAA CATCATGCAA ATGTACAGAA GTATAAACAG GCACGTAGTA GTAACGAATA A
|
Protein sequence | MKSPRSFITR LILAVTLLIA AFPLGFLLIP GLNSKSKPTQ LVVHREMRFS DVLDKLQASG AIRERWQPEL IARMVPKFRT IKAGRYTIPP NTSNFGLLWY LRTHPLDEVR VTLPEGIDRR KMARILSRKL DFDSTQFMAA TENPRLLAKY GIRASHAEGY LLPGTYDFAW GSSPDEAASF LIRQFKKLYT TERQQRAAAL GFNEHSLLTL ASIVEAETPL DKEKPTVASV YLHRLRIGMR LQADPTVQYA LGGTTRRLYY KDLAIASPYN TYRNKGLPPG PICNPGKASI IAVLNAPQSG YLYFVATGTG GHYFGASLQE HHANVQKYKQ ARSSNE
|
| |