Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43940 |
Symbol | |
ID | 7763267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4442595 |
End bp | 4444742 |
Gene Length | 2148 bp |
Protein Length | 715 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643807249 |
Product | TonB-dependent receptor family |
Protein accession | YP_002801490 |
Protein GI | 226946417 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.204907 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGA TCGCCCGTCC GACATCGTTC CGGGACGGCC TGTTCCGGGG TTCGCCGGGC AAAGTCGCCA TGGCCTTGTT ATTGGCCCAT GGGCAATTGT ACGCGCAGGA GGACAAGAAC GAAGAAAACC CCACCCTGGG CACGGTGGTC GTCCAGCATG AAAGACTGAC CCCGGCGGAA CAGGCCAGGA CCCGGATCGA AAACATTCCG GGCGGCGCCA GCGTCGTCGA CAGCGCCCAG GTGGAACTGG GCAAGGCGGC CACCGTGCAG GACATCCTGG CCTATCAGCC GGGCGTATTC GTGCAGAGCG TCGGCGGCAA CGATGCCATC AAGGTGTCCA TCCGCGGCTC CGGCATCCAG TCGGCGCCGG GCAACATGAC CGAGGGGATC AAGTTTCTTT TCGACGGCCT GGCGCTGACC GGGCCGGGCG GCACCTCCTA TGAACTGTTC GAGCCGCTCG GACTCGACCA TACGGAAGTG CTGCGCGGCG CCAACGCCTT CGACTACGGC GCGGTGACGC TGGGCGGCGC GATCAACTTC GTCAGCGCCA ACGGCCTGAA CGCGCCGGGC ACCCGGGTGC ACGTCGAAGG CGGCAAGTAC GGCTACCGCA AGGCGTTCGC AGGCACCGGC GGTACCCTGG GCGACGCCGA CTACCATTTC AGCGTGAAGG AATCCCGCCG CGACGGCTTC CAGCGGCAGA CCTTCAAGCG GGCCGAGGGC CTGTCGGCCA ACCTGGGCTA CCGCTTCAGC CCCGACCTGG AGTCGCGGCT GATCCTGCGC TACAACGAGG AGTTCCACGA GCAGTCCGCG CCGCTGACCC GCGCCCGGTT GCACCACGAT CCCTCGGACA CCACCAGCGC CTACCGGGTC TCGCGCAGCA ACGTGGACAA GTACGGCTCC TTCCTGGCCG GCCTCAAGAC CACCTGGCAG ATCGACGACC ATTCCCTGCT GGAAGTCGGG CTGGCCTACT ACAAATACCC ACAGAAACTG AACCGCTACA GCACCACGCC CAGCACCTCG GATTACCGCG ACCTCAACCC CTCGATCCGC TACATCCGCA ACGACACCCT GTTCGGCCTG CCCAGCGAGA CCATCGCCAG TTGGTACTAC ACCCGGCACG TCAAGGGCGA GACCGACGCC TACCAGAGGA ACGCCGACGG CTCCCTGACC CATACCAAGA AGAGCCGCTA CAAGGGCTCC TACGACAACG TGTTCGTGCT GGGCAATACC CTGGACCTGA CCCGCGACCT GAAACTGCTG ACCGGGCTCA CCGCCATCCA GGTCCGGCGC GACGTCGAGT TGGCGTATTC GGCGGCACCG ATCAACAGCG GCTACTCCGA CCGGGTCAAC TACGACAACT GGAGCCTGGC ACCGCGCCTG GGGCTGAGCT GGCAGCTCAA CCCGACCCTG CAGGTGTTCG CCAACGCCAG CCGCTCGATC AACCCGGTGG CCACCTGGGG CTACTCGCCT TCCACCTTCA GCACCGCGGT CAACTTCGTC AAGCAGTTGG AGGAGCAGAA GGCGAACACC CTGGAAGTGG GTCTGCGCGT CAAGACGGAG CGCCTCGACG GCAGCCTGGT TCTCTACCGC TCCTGGCTGC GCGACGAACT GCTGACGGTG GAACTCAGTC CCACCACCGA GACCTCCTCG GCGCTGGTCA CCACCTCCAA CGCCGACAAG ACCATGCACC AGGGCGTGGA GGCCGGACTG GCGGCGACCC TGTGGGAGCA TGGCGGCCAC CGCTTGAGCC TGCGCCAGGC CTACACCTTC AGCGACTTCC GCTTCCGTGG CGACGACTCC TTCGGCTCCA ACTACCTGCC CAACGTGCCG CGCCACATCT ATCAGGCCGA ACTGCACTAC CAACAGGCCG GCGGCTTCTA CGCCAGCCTG AACGTGCAGG CCCGTTCCGA CTACTACATC GACTACGCCA ACACCTTCAA ATCCAACGGC TACGCCCTGC TCGGCGCCCG CCTGGGCTAC CAGGCGCCGC AGCGCAAATG GTCGGTGTTC CTGGACGGCA GCAACCTCAC CGACCGCGAA TACGCCTCCA GCACCAACGT CGTCTACGAC GCCCAGGGCA AGGACACGGC GTCCTTCTAT CCCGGCGACG GCATCGGCGT GGTGGCCGGC CTCGACTTCC GTTTCTGA
|
Protein sequence | MNEIARPTSF RDGLFRGSPG KVAMALLLAH GQLYAQEDKN EENPTLGTVV VQHERLTPAE QARTRIENIP GGASVVDSAQ VELGKAATVQ DILAYQPGVF VQSVGGNDAI KVSIRGSGIQ SAPGNMTEGI KFLFDGLALT GPGGTSYELF EPLGLDHTEV LRGANAFDYG AVTLGGAINF VSANGLNAPG TRVHVEGGKY GYRKAFAGTG GTLGDADYHF SVKESRRDGF QRQTFKRAEG LSANLGYRFS PDLESRLILR YNEEFHEQSA PLTRARLHHD PSDTTSAYRV SRSNVDKYGS FLAGLKTTWQ IDDHSLLEVG LAYYKYPQKL NRYSTTPSTS DYRDLNPSIR YIRNDTLFGL PSETIASWYY TRHVKGETDA YQRNADGSLT HTKKSRYKGS YDNVFVLGNT LDLTRDLKLL TGLTAIQVRR DVELAYSAAP INSGYSDRVN YDNWSLAPRL GLSWQLNPTL QVFANASRSI NPVATWGYSP STFSTAVNFV KQLEEQKANT LEVGLRVKTE RLDGSLVLYR SWLRDELLTV ELSPTTETSS ALVTTSNADK TMHQGVEAGL AATLWEHGGH RLSLRQAYTF SDFRFRGDDS FGSNYLPNVP RHIYQAELHY QQAGGFYASL NVQARSDYYI DYANTFKSNG YALLGARLGY QAPQRKWSVF LDGSNLTDRE YASSTNVVYD AQGKDTASFY PGDGIGVVAG LDFRF
|
| |