Gene Avin_21220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21220 
SymbolcsbC 
ID7761047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2118693 
End bp2119904 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID643805017 
ProductIsochorismate synthase 
Protein accessionYP_002799298 
Protein GI226944225 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.12454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG TTCAACAGAC ACAGCTTGCG CAGTCCTTTT CCGAACCCTG CGGTCCGCAG 
GTGCTGTTCA CCCATTACCA CCGCGATGCC TGCATGTTCG CTTCGCCCGG CCAGACCCTG
CTGGCGCATG GCGTGGCCGC GACCCTGCCG CTGGTCATGA GCGGTTCGCC CGCCGCCCTG
GCCGGGCATG CGGCCGCTTT TCTGCGTCAG GCCAGCGAGG CCGGGCATGC CGGCTGGCTG
ATCGGCGCCA TCCCCTTCCT GCCGGGCGCT GCGGCCCATC TGTTCATCCC CGAGCATGTC
GAACTGGCTG GCGGCGGTCG CGCGGCGCTG GTCGGCGGGC CGCGTCCGGT ACGCACGGTC
GCGGCGCGCA GCGAACCGGC CGAGGCGGTC TACGAGCAGA ACGTGAGCCG GGCCTTGGAG
CGGATCGCCG ACGGGAAGTT GCAGAAAGTG GTGCTGTCGC GCTCGCTGCA CATCCAGGCG
GAGCTGGATC AGGCCGAACT GCTGCAGACC CTGGCCAGCC GCAACCCGCT GGGCTACACC
TATGCCATTC CGCTGCCGAC CGCCAACGGC GAGCGCCGCA GCCTGATCGG CGCCAGCCCC
GAGCTGCTGC TGGCCCGGCA CGGCAACCGG GTGATTTCCA ACCCGCTGGC CGGCTCCATC
CCGCATAGCA GCGACCCGGC CGAGGACCGC CAGCGCGCCG AGAACCTGCT GCGCTCGCCC
AAGGACCTGC ACGAGCACGC GCTGGTGGTG GAAGCCGTCG CCGAGGCCCT GCGCCCCTAT
TGCACCGATC TGCTGGTCCC CGAAGCGCCG TCGCTGCTCT CGACCCCGAC CATGTGGCAC
CTGTCCACCG AAGTCACCGG CACGCTGCGC GACCCGGCCA CGACCTCGCT GGAACTGGCG
CTGGCGCTGC ATCCGACTCC GGCCGTGGGA GGCTATCCGA CCGCCGAGGC GCGGGAGTTC
ATCCAGGCGT TCGAAGGCTT CGACCGGGGC TTCTTCACCG GCCTGGTCGG CTGGTGCAAC
GCGCAGGGCG ATGGCGAATG GGCGGTGACC ATCCGCTGCG CCGAAGTCGG CGAGCAGTCG
TCGACCCTCT ATGCCGGGGC CGGCATCGTG GCCGGCTCCG AACCGGCGCT GGAACTGGCG
GAAACCGCCG CCAAGCTGCG CACCATGCTG GGCGCCATGG GGCTCGCGCT GGCCGAGGAG
AACCGGGCAT GA
 
Protein sequence
MSFVQQTQLA QSFSEPCGPQ VLFTHYHRDA CMFASPGQTL LAHGVAATLP LVMSGSPAAL 
AGHAAAFLRQ ASEAGHAGWL IGAIPFLPGA AAHLFIPEHV ELAGGGRAAL VGGPRPVRTV
AARSEPAEAV YEQNVSRALE RIADGKLQKV VLSRSLHIQA ELDQAELLQT LASRNPLGYT
YAIPLPTANG ERRSLIGASP ELLLARHGNR VISNPLAGSI PHSSDPAEDR QRAENLLRSP
KDLHEHALVV EAVAEALRPY CTDLLVPEAP SLLSTPTMWH LSTEVTGTLR DPATTSLELA
LALHPTPAVG GYPTAEAREF IQAFEGFDRG FFTGLVGWCN AQGDGEWAVT IRCAEVGEQS
STLYAGAGIV AGSEPALELA ETAAKLRTML GAMGLALAEE NRA