Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37020 |
Symbol | |
ID | 7762596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3757265 |
End bp | 3759505 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643806569 |
Product | outer membrane receptor FepA |
Protein accession | YP_002800823 |
Protein GI | 226945750 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.145193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCCGC GTTTCAGGCT TGCGCATCTG CCCCTGGCGT TGCTGGCAGT TTCTTCCCCC TTCCTGGAAG CAGCAGAGGA GACTACTACC GAGGAGGGCG CCGCGGGCGA TGGCTCCTAT CGGGCGGAGG AGGGCGCCCT CGAACTGGAC GACGTGCTGG TCACCGCCGA GCGGGAACTG AAACAGGCTC CCGGCGTTTC CATCATCACC GCCGACGACA TCAAGAAGCG CCCGCCGGTC AACGACCTGT CCGACATCAT CCGCAAGATG CCCGGCGTCA ACCTCACCGG TAACAGCGCC AGCGGCCAGT ACGGCAACAA CCGGCAGATC GATATCCGCG GCATGGGCCC GGAGAACACC CTGATCCTGA TCGACGGCAA GCCGGTGCGC TCGCGCAACT CGGTGCGCAT GGGCCGCAGC GGCGAGCGCA ACACGCGCGG CGACACCAAC TGGGTGCCGG CGGAACTGGT GGAACGCATC GAGGTACTGC GCGGCCCGGC GGCGGCGCGC TACGGCTCGG GGGCCATGGG CGGCGTGGTG AACATCATCA CCAAGGCGCC GACCGAGAAG ACTCACGGCT CGGTCTCCAC CTACTACAAC AGCCCGGAAA GCGAATACGA AGGGCTGAGC AAGCGCTACA ACTTCAGCCT GACCGGCCCG TTGGTCCAGG GCCTTTCCTA CCGCATTCAC GGCAACCTCA ACAAGACCGA GGCCGACCGT CCCGGCCTGA ACTCCGCCTA TACGGCCGAT GGCGGCATCA CCCCGCCAGC CGGCAAGGAA GGGGTGCGCA ACCGCGACCT CAACGGCATG CTGCGCTGGG ATCTGAACTC GCAGCAGACC ATCGAGATCG AGGGCAGCTA CAGCCGTCAG GGCAACATCT ACGCCGGCGA CCGGGCCGTC AGCACCACGG GCTCCGAATT GCTGAACCAG TTGTTCGGCC GGGAAACCAA CATCATGTAC CGCCGCGCCG GCTCGCTCAC CCACCGCGGC AACTGGGACT GGGGTACCTC GCAACTGGTG TTCTCCTACG AGAACACCCG CAACCGGCGC CTCAACGAGG GCCTGGCCGG CAGCAGCGAG GGCAGTATCA ACGCTGGCGA CATGTCCACC TCCGAATACG ACAACTATGC GCTGAGCGGC GAGGTGAACA TCCCCTTCAA CTTCGGCTTC AGCCAGGTGG CGACCCTGGG CTTCGAGTAC ACCAAGGAGG TGCTGGACGA CCCCTTCTCG ATGAGCCAGA GCGCCACCGC GATTCCCGGT ACCGCGACTA CCGGCCGCGA CAGCGAGGCG ACCAACGAGA ACATCGCCTT CTTCGTCGAA AACAACATCC ATCTCACCGA CCGCTGGACC CTGACGCCCG GCGTGCGCTT CGACAACCAC ACCCAGTTCG GCAGCAACTG GAGCCCGAGC CTGAACACCT CCTACCAGTT GACCGACGCC GTCAGCATCA AGGGCGGCCT GTCGCGGGCC TTCAAGGCGC CCAACCTGTA CCAGTCGAAC AGCAACTACA TCTACTACAC CATGGGCAAC GGCTGCCCGG CCGACTCGCC GAACATGGGC GGCGGCTGCT ACGTGCAGGG CAGCGACGAC CTTGACGCGG AAAAGAGCTG GAACTTCGAG CTGGGCGTCG CCTACGCCCA GAACGGCTGG AACGCCGGGG TGACCTACTT CCGCAACGAG TACGAAGACA AGATCGTGGC CGGTCTGACC CCCACCTCCA TCACTACGGC GGACGGGCAG ATCCTCCAGT GGGAGAACGC TTCCAAGGCG GTGGTGGCCG GCTGGGAAGG CACCCTGAAC ATCCCGCTGA TGGGCGTGGA CGGCGACGTC CTGAGCTGGA ACACCAATTT CACCTACATG ATCGAGAACA AGAACAAGAG GACCGGCGAG CCGCTGTCGG TCATTCCCAA GTTCACCATC AACTCGATCC TCGACTGGCA GGCCACCGAG GCGCTCAACC TGAACCTGAG CATGACCCTC TACGGTTACC AGGACCCGCG CAATCTGAGC GGCACCGGGG CCCAGGAAAG CGGCGAGGCC CTCAAGCAGC AGGGCGGCTA CACCCTGTGG GCGGTCAACG GCAACTACGA GCTGACCAAG AACTGGAGCT TCGGCGCCGG CATCAACAAC CTGCTCGATA AGGAAATCAA GCGCGAGGGC AACGCTTCCG GTAGCGGCGG CGCCTCCACC TACAACGATC CGGGCCGGGC CTACTACGCT TCGGCGAAGT TCACCTTCTG A
|
Protein sequence | MYPRFRLAHL PLALLAVSSP FLEAAEETTT EEGAAGDGSY RAEEGALELD DVLVTAEREL KQAPGVSIIT ADDIKKRPPV NDLSDIIRKM PGVNLTGNSA SGQYGNNRQI DIRGMGPENT LILIDGKPVR SRNSVRMGRS GERNTRGDTN WVPAELVERI EVLRGPAAAR YGSGAMGGVV NIITKAPTEK THGSVSTYYN SPESEYEGLS KRYNFSLTGP LVQGLSYRIH GNLNKTEADR PGLNSAYTAD GGITPPAGKE GVRNRDLNGM LRWDLNSQQT IEIEGSYSRQ GNIYAGDRAV STTGSELLNQ LFGRETNIMY RRAGSLTHRG NWDWGTSQLV FSYENTRNRR LNEGLAGSSE GSINAGDMST SEYDNYALSG EVNIPFNFGF SQVATLGFEY TKEVLDDPFS MSQSATAIPG TATTGRDSEA TNENIAFFVE NNIHLTDRWT LTPGVRFDNH TQFGSNWSPS LNTSYQLTDA VSIKGGLSRA FKAPNLYQSN SNYIYYTMGN GCPADSPNMG GGCYVQGSDD LDAEKSWNFE LGVAYAQNGW NAGVTYFRNE YEDKIVAGLT PTSITTADGQ ILQWENASKA VVAGWEGTLN IPLMGVDGDV LSWNTNFTYM IENKNKRTGE PLSVIPKFTI NSILDWQATE ALNLNLSMTL YGYQDPRNLS GTGAQESGEA LKQQGGYTLW AVNGNYELTK NWSFGAGINN LLDKEIKREG NASGSGGAST YNDPGRAYYA SAKFTF
|
| |