Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3801 |
Symbol | |
ID | 7267875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4637015 |
End bp | 4638109 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643568609 |
Product | chorismate synthase |
Protein accession | YP_002465073 |
Protein GI | 219850640 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.209902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.33378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGAA ATAGCTTTGG TCACGTCTTT CGGCTGACAA CGTGGGGTGA ATCGCATGGC CCGGCAGTGG GGTGTACCGT AGATGGTTGC CCGGCCGGGT TGCCGCTCGA TGTGGCCGAT ATTCAACGCG AACTCGACCG GCGGCGGGTT GGTCAAAGCC GGGTCAGTTC GCAACGGCGC GAAGCTGATG AGGTACAGAT ACTCTCCGGT GTGTTTGAGG GTCGCACCAC CGGAACGCCG ATAACGATGG TTGTTTACAA TACCGATGCC AAATCTCACC ACTACGATAC TATCAAAGAC GCCTACCGTC CCGGTCACGC CGATTATACG TGGGACGTAA AATACGGTTT TCGGGATTGG CGTGGTGGTG GGCGTTCGTC AGCCCGCGAG ACGATTGGGC GGGTAGCCGG TGGTGCAATT GCGCGCAAAC TGTTGGCGAC GGTGGGGGTA ACAATTGTAG GGTATACCCT CCAACTAGCC GATTTGCGCG CCGAGGTCTT TGATGAAGCA GAGATCGAAC GCAACATCAT GCGGTGCCCT GATGCGCGGG TGGCGGCGTT GATGGTTGAA CGTGTCGATC AGGCGCGTCG CGAACTCGAT TCGCTGGGTG GGATCGTTGA AGTCCGGGCG CGAGGTGTAC CTCCCGGCCT CGGTGAGCCG GTGTTTGATA AGCTCCAAGC CGATATCGGT AAGGCCATGT TCTCGATTCC GGCTATCAAA GGAGTGGAGA TTGGTGAAGG GTTTGGGGTG GCAATGCTGC GTGGCTCGCA GAACAACGAT CCCTTCATCC GGCGCGAGGA TGGTTCAATC GGTACGACCT CGAACCATCA CGGCGGTATT CTCGGCGGCA TTTCAACCGG CGAAGAGATC GTGGTACGAT TGGCAGCCAA ACCACCGGCC AGTATTGCCC GCCCACAACA AACGGTCGAC CGCGACGGTA ACCCGGTAAC GATTGAGGTG CATGGTCGCC ATGACCCAAC GGTCTTGCCG CGTCTCGTGC CGGTGGCCGA AGCTATGCTG GCGTTGGTGC TGGCCGATCA TCTGTTGCGA CAGCGGCTTG CTCGGGTGTC GTGGTCGGAG CGTGATGATG GGTAA
|
Protein sequence | MPGNSFGHVF RLTTWGESHG PAVGCTVDGC PAGLPLDVAD IQRELDRRRV GQSRVSSQRR EADEVQILSG VFEGRTTGTP ITMVVYNTDA KSHHYDTIKD AYRPGHADYT WDVKYGFRDW RGGGRSSARE TIGRVAGGAI ARKLLATVGV TIVGYTLQLA DLRAEVFDEA EIERNIMRCP DARVAALMVE RVDQARRELD SLGGIVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK GVEIGEGFGV AMLRGSQNND PFIRREDGSI GTTSNHHGGI LGGISTGEEI VVRLAAKPPA SIARPQQTVD RDGNPVTIEV HGRHDPTVLP RLVPVAEAML ALVLADHLLR QRLARVSWSE RDDG
|
| |