Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21220 |
Symbol | csbC |
ID | 7761047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2118693 |
End bp | 2119904 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643805017 |
Product | Isochorismate synthase |
Protein accession | YP_002799298 |
Protein GI | 226944225 |
COG category | [H] Coenzyme transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1169] Isochorismate synthase |
TIGRFAM ID | [TIGR00543] isochorismate synthases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.12454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTG TTCAACAGAC ACAGCTTGCG CAGTCCTTTT CCGAACCCTG CGGTCCGCAG GTGCTGTTCA CCCATTACCA CCGCGATGCC TGCATGTTCG CTTCGCCCGG CCAGACCCTG CTGGCGCATG GCGTGGCCGC GACCCTGCCG CTGGTCATGA GCGGTTCGCC CGCCGCCCTG GCCGGGCATG CGGCCGCTTT TCTGCGTCAG GCCAGCGAGG CCGGGCATGC CGGCTGGCTG ATCGGCGCCA TCCCCTTCCT GCCGGGCGCT GCGGCCCATC TGTTCATCCC CGAGCATGTC GAACTGGCTG GCGGCGGTCG CGCGGCGCTG GTCGGCGGGC CGCGTCCGGT ACGCACGGTC GCGGCGCGCA GCGAACCGGC CGAGGCGGTC TACGAGCAGA ACGTGAGCCG GGCCTTGGAG CGGATCGCCG ACGGGAAGTT GCAGAAAGTG GTGCTGTCGC GCTCGCTGCA CATCCAGGCG GAGCTGGATC AGGCCGAACT GCTGCAGACC CTGGCCAGCC GCAACCCGCT GGGCTACACC TATGCCATTC CGCTGCCGAC CGCCAACGGC GAGCGCCGCA GCCTGATCGG CGCCAGCCCC GAGCTGCTGC TGGCCCGGCA CGGCAACCGG GTGATTTCCA ACCCGCTGGC CGGCTCCATC CCGCATAGCA GCGACCCGGC CGAGGACCGC CAGCGCGCCG AGAACCTGCT GCGCTCGCCC AAGGACCTGC ACGAGCACGC GCTGGTGGTG GAAGCCGTCG CCGAGGCCCT GCGCCCCTAT TGCACCGATC TGCTGGTCCC CGAAGCGCCG TCGCTGCTCT CGACCCCGAC CATGTGGCAC CTGTCCACCG AAGTCACCGG CACGCTGCGC GACCCGGCCA CGACCTCGCT GGAACTGGCG CTGGCGCTGC ATCCGACTCC GGCCGTGGGA GGCTATCCGA CCGCCGAGGC GCGGGAGTTC ATCCAGGCGT TCGAAGGCTT CGACCGGGGC TTCTTCACCG GCCTGGTCGG CTGGTGCAAC GCGCAGGGCG ATGGCGAATG GGCGGTGACC ATCCGCTGCG CCGAAGTCGG CGAGCAGTCG TCGACCCTCT ATGCCGGGGC CGGCATCGTG GCCGGCTCCG AACCGGCGCT GGAACTGGCG GAAACCGCCG CCAAGCTGCG CACCATGCTG GGCGCCATGG GGCTCGCGCT GGCCGAGGAG AACCGGGCAT GA
|
Protein sequence | MSFVQQTQLA QSFSEPCGPQ VLFTHYHRDA CMFASPGQTL LAHGVAATLP LVMSGSPAAL AGHAAAFLRQ ASEAGHAGWL IGAIPFLPGA AAHLFIPEHV ELAGGGRAAL VGGPRPVRTV AARSEPAEAV YEQNVSRALE RIADGKLQKV VLSRSLHIQA ELDQAELLQT LASRNPLGYT YAIPLPTANG ERRSLIGASP ELLLARHGNR VISNPLAGSI PHSSDPAEDR QRAENLLRSP KDLHEHALVV EAVAEALRPY CTDLLVPEAP SLLSTPTMWH LSTEVTGTLR DPATTSLELA LALHPTPAVG GYPTAEAREF IQAFEGFDRG FFTGLVGWCN AQGDGEWAVT IRCAEVGEQS STLYAGAGIV AGSEPALELA ETAAKLRTML GAMGLALAEE NRA
|
| |