Gene Avin_21210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21210 
SymbolentE 
ID7761046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2117050 
End bp2118696 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content71% 
IMG OID643805016 
Productenterobactin synthetase component E (2,3-dihydroxybenzoate-AMP ligase) 
Protein accessionYP_002799297 
Protein GI226944224 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0419167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCT CGACCCCGAC CTCTTCTTCC GCCGAAGGCG CGGACTTCAC GCCCTGGCCC 
GCCGAATTCG CCGCGCGCTA CCGGGCGGCC GGCTACTGGC GCGGCGAACC GCTGGACAGC
CTGCTGCGCG CCGGCGCCGA CCGCCATGCC GGACGCACCG CGCTGGTCTG CGGCGAGCGG
CGCTGGACCT ACGCCGAACT GGACGCCCGC GTCGACCGGG TGGCGGCCGG GCTGGTCGGA
CAAGGCATCG CCGCCGGCGA CCGCGTCGTG GTGCAGTTGC CCAACATCGC CGAATTCGTG
ATGGTCATCT TCGCCCTGCT GCGCCTGGGC GCCCTGCCGG TCTTCGCCCT GCCGGCCCAC
CGCCGGGCCG AAATCGGCTA CTTCTGCGCC TTCGCCGAAG CCAAGGGGCT GGTCATCCGG
GATCGCCACG CCGGTTTCGA CTATCGCCAG ATGGCCCGCG ACATCCGCGA CGAAGCAGCG
ACCCTGAGCA CCGTCGTGGT GGTCGGCGAG GCCGAGGAAT TCATCCCCTT CGAGCGGCTC
GACGCCGAGC CGCTGCCGCT GCCGGAACCC AAGGCCGACA CGCTGGCCTT CCTGCAACTG
TCGGGCGGCA GCACGGGACG GCCGAAGATG ATCCCGCGCA CCCACGACGA CTATTTCTAC
AGCGTGCGGG CCAGCGCCGA GATCTGCGGC CTCGGCCCGG ACACCGTGTT CCTCTGCGCC
CTGCCGGCGG CCCACAACTT CGCGATGAGT TCGCCGGGCA TCCTGGGCGT CCTCTACGCC
GGCGGCAGCG TGGTGCTGGC GCCCGATCCC AGTCCCGACA CCTGTTTCGC CCTGATCGCC
CGCGAGCGGG TCGACATGAC CGCGCTGGTG CCCTCCGTGG CGCTGGCCTG GATGGAGGCC
GCGCCGGCCC GGCAGGCCGA ACTGGCCAGC CTGAAGGTGC TGCAGGTCGG CGGTTCGCGC
CTCAGCGACG AAGCCGCGCA ACGGGTCGAC AGCCTGCTCG GCTGCAAGCT GCAACAGGTG
TTCGGCATGG CCGAGGGACT GGTCAACTAC ACCCGGTTCG ACGATCCCCA GGAGCTGATC
GTCGGCACCC AGGGCCGCCC CATCTCCCCG GACGACGAAG TGCGCATCGT CGACGACGAG
GACCGCGACG TGCCGCCGGG CGAAACCGGG CACCTGATCA CCCGTGGCCC CTACACCATT
CGCGGCTACT TCCGCGCCGA TGTGCACAAC GCCCGCTCCT TCACCCGCGA CGGCTTCTAC
CGCACCGGCG ATGTGGCCCG CCGCCTGCCC AGCGGGCACC TGATCGTCGA GGGCCGCGAC
AAGGACCAGA TCAACCGCGG TGGCGACAAG GTGGCCGCCG AGGAAGTGGA AAACCACCTG
CTGGCGCATC CCGCCGTGCT GGATGTCGCC GTGGTCGCGA TGCCCGACGC CTTCCTCGGC
GAGCGCACCT GCGCCTTCAT CGTGCCGCGC GGCGAAGCGC CCCGGCCGCT GGAGATCAAC
CGCTTCATGC GCGAACGCGG CGTCGCCGGC TACAAGGTGC CGGACCGCAT CGAGTTCGTC
GACCAGTTGC CCAAGACCGG CGTCGGCAAG ATCGACAAGC GCGCCCTGCG CGAACGCATC
GCGGCACGCC TGCAGGCCAC GGCCTGA
 
Protein sequence
MSTSTPTSSS AEGADFTPWP AEFAARYRAA GYWRGEPLDS LLRAGADRHA GRTALVCGER 
RWTYAELDAR VDRVAAGLVG QGIAAGDRVV VQLPNIAEFV MVIFALLRLG ALPVFALPAH
RRAEIGYFCA FAEAKGLVIR DRHAGFDYRQ MARDIRDEAA TLSTVVVVGE AEEFIPFERL
DAEPLPLPEP KADTLAFLQL SGGSTGRPKM IPRTHDDYFY SVRASAEICG LGPDTVFLCA
LPAAHNFAMS SPGILGVLYA GGSVVLAPDP SPDTCFALIA RERVDMTALV PSVALAWMEA
APARQAELAS LKVLQVGGSR LSDEAAQRVD SLLGCKLQQV FGMAEGLVNY TRFDDPQELI
VGTQGRPISP DDEVRIVDDE DRDVPPGETG HLITRGPYTI RGYFRADVHN ARSFTRDGFY
RTGDVARRLP SGHLIVEGRD KDQINRGGDK VAAEEVENHL LAHPAVLDVA VVAMPDAFLG
ERTCAFIVPR GEAPRPLEIN RFMRERGVAG YKVPDRIEFV DQLPKTGVGK IDKRALRERI
AARLQATA