Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09680 |
Symbol | |
ID | 7759912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 913772 |
End bp | 917650 |
Gene Length | 3879 bp |
Protein Length | 1292 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643803873 |
Product | AMP-dependent synthetase and ligase family protein |
Protein accession | YP_002798175 |
Protein GI | 226943102 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.540279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGC TCGAAAGGAG CTATCGGCTG CTGGCCGCAC AGGAGGGTTT TTATTATGGT CATGAGTTGC GTGATAATAA AGCTGATTTC AATGTGGCTG AATATGTTGT TATTGAAGGC GAGTTTGATG TTGATCGGTT CTATCAAGCC AATAAAAGCG TGCTGGCAGA TATAGAGACG TTGAGCTTTG TTTTTCGCTC TCATGATGGC CATGGTAGGC AGCAACACTT GCCGCGTCCA GTTAAGCTCG AGTTGGTTGA CCTGCGTGGT CTCTCAACAG GTTTCGAGAT TGCTTTGAGC CAAATGACGG TGGACTCAGC TAAGCCCTTC GATATCACTA ACGATGTTTT GTACCGTCAA TGCCTGTACA GGATCAGCGA CAGCTGTCAA CTCTGGTACT TCTGCAGTCA CCATCTCGTA CTTGACGCTT TTGCCATGCT TCAGGTCTTA AGTCGGGTGG CCAAAGCTTA TCGCTCTGTA GGAGCAGTAG TAATCGACCG GACAGCCTCG AATTTTTCTG CCGTCGTATC TGCAGAAGAT GCCTATCTGG AATCAGCCAC TTATGCTCTC AACTATGATT ATTGGCAACA CCAAGTGTCC TTGCTGCCGG AACCAACAAG CCTCAGCCCG AAGCCACAAC ATGCAGGAGA GTGCCTGCGT TTCTCCTGTG AAGAGCCTGT GCCGACGGGG TGGTTTGGCC TCATGGCGGA GAAGGTAGAT TGGCCAAGTC GATTGATAAC GAGCTTGCTG GTCTATCTGT GGATGATGAG CGGAAATCGT AGGCAGGTTA TTGGTATCCC CATGCTAGCC CGGAAAGCGC CAGCCGAATG GGCGGTCCCC GCTTGCCGCG TGAATGTCTT GCCACTGTCC GTCGAGCTTT GCGCTGAGGC CAGCGTCGAG CAGACCCTGG CAGCTGTCAG TGACCGCCTC TCGCAACTCA AGCGTCACCA GCAGTATCCC GCTGGAGCGC TGTCCAGTGG CCTGCAATCC TTGCCCTTCA ATACCGTGCT GAATATCTTG CCCTTCACGC CCCACGTTAT GTTTGACATA CAGCAGGAAA GTGAAGTGCG GAACATCAGG GCGGGTGCAG TCAACGACAT CGCTTTCACT GTCAGGCCTG ACTTGGCCGG GTCAAGGCTG CATGTCACCG TCGATGCCAA TGGCCGCTTG TACTCCGCGA GCGACCTGCA AATGCATGTT AGTGGATTGT TGAACGTCGT TCGCACCCTG ATGGCGCAAC CGCAAATCCA TCTGTCGGCG CTGCCTAGGC CTAGGCCCAT GGCTTGGCTG GAAGGGGCTC GAGCTTGCGC ACGTCAGGAT GTACTCAGTT CGATTTTCGA CCGTGCTCAG AACCAAGGAA AGCAGGTCGC GCTGACAGAC ATCTGCAATC CACAGGAGAA CTGGCAAAGT ATCACTTATC AGGTGTTGGT TGATGAGGTC GAGCGGTTGG CTCATTCTTT GCGAGAGTGT GCGGCTTTTG ATCTGTTAGT ACTAGATTTG CCTCGCTCCA GCGATTCCAT CCTTTTCATG CTGATAGCTC TGCGCTTGGG TGTGCCTTTC GTCGTTCTCA ATCCCATGAG GGCGCAACAG ATGCTAGAGG TCCTGGTCCA GCATGCTACT CACGCCTTGC TGGTGGCCAG GCAGGAAGTA TCTAGCGACT CTGCCGGGTG GCAGGAAAGT GCGCAGTACC TGCAGATAAA AGGATATGAG GCACTGCGCT TCACGCGTTT GCAGGGTGAG GGCACAAGCG GATTCGGTGC GCTTGCCTAC TTGATGTTCA CATCAGGTAC CGCTGGTAAG GCCAAGGCAG TGAAGATATC TAGGCAAGCA ATGGATGCGT TTGTCGGCAC GGCGGTTAAT CGATATGGCT TCGACACTAC GGATGTAGTA CTCAACTTCT CGGAGCATTT TTTCGATGCC TGCATTGAGG AGATTTTCGG GGCATTGACC GCCGGTGCTC GGCTGGTGAT TCGTCCAGCC GATGCTCATA GGTCGACGCG GCATTTCCTC AGCTTCTGTG CTATGCAGGC GATCACTGTC CTTGACTTAC CCACAGCCTA TTGGCATGAA ATGGCTATTG CCATGGATGC CCATATGATG AAGCGTACCC GGGTACGCCT TGTGGTCATC GGTGGGGAGC GTGTGTCCGA GCAGGCTCTA TACCATTGGT TCGAGAAAGT CGGTGATAGT CCCCAGCTAT TCAACTCCTA TGGGCCGACA GAGGCTACTG TAGTTGCCAC CTGTGCATTG CTGGACAAGG CAGAAGGGGC TTCGATAGGG CGCCCCTTAG CCGGAGTGCA GTGCGCGATC CTGGGCCCTG GGCTAACACT TCTGCCACAA GGAGCGAGTG GTGAGCTGCT GCTATTCGGT GATACGCTCG CCGAAGGTTA CCTAGGTGAT GCAGAACGCA CTACAGAAAG ATTCGTCGAG ATCTGCATCG ATGGAAAAAT GCGTCGTGGC TACCGCACTG GTGATCTAGC ACTCCTAGGC AGGGACAACC AGTTGCACTA CATGGGCCGC CTAGACCATG AGGTGAAGAT CGCCGGTCAA CGGCTGAATC TGTCTGAGCT GGAGGCCTTG ATCGAAGCCT GTGCCGGAGT CAGGGAAGTC TGCGTCGTAC TGCAAGAAAG TATCTTTCCT CGGTCACTGA GCGCGCATAT CTATGGTGAT CCAGCACTAG AGCAGACGGT TCGCCGAGAC TTGGGCTCTC AGATAGCGCC TGAGTTCCTG CCGCGCAGCT ATCACTGGCA TGCCTCGCCA CTTCCCAAAA CTTTCTCGGG GAAAATCGAT CGTGCCCAGC TCCAGCGCCA TGGCAAAAAG ACGCCCCCCT TGTCCTTGCC AAAGATTGTT GAGCGGCAGG GCTGGCTCGA GCAGCATGTG CGCATGATCT GGTGCGAAAG CTTGGGAAGC AATGAGGCTC TCGATTTTTT TCAGCAGGGC GGTGACTCGT TAATGGCATT GCGTATGGTC AACCGGCTGA ATGCCGAACT CGGCTTGGCA TTGACTATGG AGACGGTGTT TGATTCGAAG ACCCTGAAGG GACTACTTGC CTATATTGTT CGGGAAATAA CGAAACTCGG TTTGCAAGTT GATTGCTTGG AACTGCGCAA TGCATTGTTG AAACCACCTT CTCCTGTGCA ATGGTCAGCT CGGCGCACGG TTTTCCTCGA AGCGCATCCG CTGCACCACT TTGCTGATCT GCAAGGATTG GCCGAAAAGC TGGACCTTTA CTTGGTCGTG AGTCGTAATG ACGATGCTCA GCCAGTGGGC AAAGACTGCT CTGTCCAAGA CAGGCTGTAC TGGCACGATG TACAGGGCGG CTTATGGCAT CAAGGCCAAG CTTGGCCTGT GACACAGATA CCCGCAGTGC AATACGCAAT CTTCATCATG CCGCACCGTC CAGAAGCCTA TCCGGCTTGG TCGCAGCGAT GCCTGGAAAG GCTCAGGCAA CTACAGATGC ACAATCTGGA GCGTGCACTG ATCGTTACCG ATCGGGAGGG CCGGCGCTGG TTCGAAAAAC ACATCTTGTC GGCTGCGGAA TGCAAACGGC TGGAATTTCA CACATGTTCA CCTCACGCGC TGCAGGAGCG GCAACATCAG GAAGAACTGA TCGTGGACTG GTCTGTGCGT ATAGGTTACT TCCCAGATCT GAACTTGCCG CCATGCTTCG TAAGCATGCG TAATCTGCTT GCTGAATTAG CCGGCCGGCC AGGGCGGGTA CGCATGTCTG TTCGGGATCG CCTGTTCGCA CGAGCGCAAA GTTTCAACCC CGATGTGAAA GTCTGTAGCT TTGCCCAATG GCTGAAAGAA ATCACTCGAT ACTCAGGGAG CCCCTGCCAT AGGTTGGAGC AGCGCACTGT ATACCGTTTG AACTTGTGA
|
Protein sequence | MGMLERSYRL LAAQEGFYYG HELRDNKADF NVAEYVVIEG EFDVDRFYQA NKSVLADIET LSFVFRSHDG HGRQQHLPRP VKLELVDLRG LSTGFEIALS QMTVDSAKPF DITNDVLYRQ CLYRISDSCQ LWYFCSHHLV LDAFAMLQVL SRVAKAYRSV GAVVIDRTAS NFSAVVSAED AYLESATYAL NYDYWQHQVS LLPEPTSLSP KPQHAGECLR FSCEEPVPTG WFGLMAEKVD WPSRLITSLL VYLWMMSGNR RQVIGIPMLA RKAPAEWAVP ACRVNVLPLS VELCAEASVE QTLAAVSDRL SQLKRHQQYP AGALSSGLQS LPFNTVLNIL PFTPHVMFDI QQESEVRNIR AGAVNDIAFT VRPDLAGSRL HVTVDANGRL YSASDLQMHV SGLLNVVRTL MAQPQIHLSA LPRPRPMAWL EGARACARQD VLSSIFDRAQ NQGKQVALTD ICNPQENWQS ITYQVLVDEV ERLAHSLREC AAFDLLVLDL PRSSDSILFM LIALRLGVPF VVLNPMRAQQ MLEVLVQHAT HALLVARQEV SSDSAGWQES AQYLQIKGYE ALRFTRLQGE GTSGFGALAY LMFTSGTAGK AKAVKISRQA MDAFVGTAVN RYGFDTTDVV LNFSEHFFDA CIEEIFGALT AGARLVIRPA DAHRSTRHFL SFCAMQAITV LDLPTAYWHE MAIAMDAHMM KRTRVRLVVI GGERVSEQAL YHWFEKVGDS PQLFNSYGPT EATVVATCAL LDKAEGASIG RPLAGVQCAI LGPGLTLLPQ GASGELLLFG DTLAEGYLGD AERTTERFVE ICIDGKMRRG YRTGDLALLG RDNQLHYMGR LDHEVKIAGQ RLNLSELEAL IEACAGVREV CVVLQESIFP RSLSAHIYGD PALEQTVRRD LGSQIAPEFL PRSYHWHASP LPKTFSGKID RAQLQRHGKK TPPLSLPKIV ERQGWLEQHV RMIWCESLGS NEALDFFQQG GDSLMALRMV NRLNAELGLA LTMETVFDSK TLKGLLAYIV REITKLGLQV DCLELRNALL KPPSPVQWSA RRTVFLEAHP LHHFADLQGL AEKLDLYLVV SRNDDAQPVG KDCSVQDRLY WHDVQGGLWH QGQAWPVTQI PAVQYAIFIM PHRPEAYPAW SQRCLERLRQ LQMHNLERAL IVTDREGRRW FEKHILSAAE CKRLEFHTCS PHALQERQHQ EELIVDWSVR IGYFPDLNLP PCFVSMRNLL AELAGRPGRV RMSVRDRLFA RAQSFNPDVK VCSFAQWLKE ITRYSGSPCH RLEQRTVYRL NL
|
| |