Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_52120 |
Symbol | |
ID | 7764049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5324279 |
End bp | 5326285 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643808028 |
Product | TonB-dependent receptor |
Protein accession | YP_002802262 |
Protein GI | 226947189 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.618776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCACC CCGCATCTCG CCTGCCCCTT TTGGCGGGCG GCCTCTGTCT TTGCCTGAAC GGTCTGTCCC AGGCCGATGA CAATGTCTTC GAACTCGGCC AGATCTTCGT GCTGGGCAGC CATGGCGACG CCAGCCGCCT CGACAACACG GACTCGGTGG ACAGCCGGGA CATGCGCCTG CACGACCGCG AAACCGTCGG CGAGGCGCTC AACCTGATTC CCGGGGTCAG TCTCAGCAAG ACCGGCGCTC GCAACGAGCA GATGGTCTAC GTGCGTGGCT TCGATCTGCG CCAGGTGCCC GTCTTCATCG ACGGCATCCC GGTGTACGTG CCCTACGACG GCTATCCCGA TCTCGGCCGC TTCACCACCT TCGAACTATC CAGGATCGAG GTCTCCAAGG GCTTCAGCTC CATGACCTAC GGCCCGAACA CCCTGGGCGG GGCGATCAAC CTGGTGACCC GGCGGCCGCA GGAGGAATTC GAAGGCGAGA TCGGCACCGG TTTCAGCTTC ACCGACCGAG CGGAGAACAA CGGCGAATGG ACATACGCCA ACTTCGGCAC CCGGCAGGAC AACTGGTGGG CGCAGATGGG TCTGTCCTAC CTCAACGAGG ATTATTTCCG CCTGCCCGGC AACTTTGACG ACGAACGCCT GGAGGATGGC GGGCGGCGCG GCAACAGCGA CCGCAAGGAC AAGAAGATCA ACTTCAAGGT CGGCTTCACG CCCAACGACA CCGACGAATA CGTGCTGGGC TACGTCAAGC AGGACGGCGA AAAGGGCAAC CCCGCCTACG CCGGACACCT GGGCGGCAGC TACAACCGCT TCTGGCGCTG GCCCAAGTGG GACAAGACCA GCTACTACAT CAACACGGTC ACCGCTTTCG GCGATCACAA GGTCAAGGCC CGCCTCTACC ACGACATCTA CAAGAACGAC CTGTACTCCT ACGACAACGA CAGCTACAGC ACGCAGAAAA CCTCCCGGGC GTTTCGCAGC TACTACGACG ACTACAGCAC CGGTCTCAGC CTCGAGGACG AGTGGACCCT GGACGAAAAC AACCAGTTGC GCCTGGCCTT CCACCACAAG CAGGACGTAC ACCGCGAGCA CGACGCCGGC GAGCCCAAGC AGCGTTTCGA GGACGAAACC CAGTCCCTCG CCCTGGAGTA CACCCGCAAG CTGACCGAAC GTCTCACCCT GATCGCGGGT TTCTCCCATG ACCGGCGCAA CGGCCGCGAA GCCGAGACCT ACACCAGCGC CGCCGGCCTG TTCGAGGAGG AAGGCGGGCG CGAATCCACC AACAACGGCC AGCTCGGGCT GTTCTTCCAG GCCGACCCGC GGACCCTGTG GCGCTTCACC GTGGCGCGCA AGAGCCGCTT CCCGACCATG AAGGATCGCT ACTCCTACCG TTTCGGCAGC GCCCTGCCCA ACCCCGACCT GAAAACCGAG GAAGCCACCC ATTTCGAGAT CGGCTACAAG CGTGCCCTGA GCGACAGCCT GGAACTGGAT CTGGCGCTGT TCCGCAGCCA CGTGGACGAC CTGCTGCAAT CGGTGCGCGT GGCCGGCAGC GCCTGCTCCA ACCCGCCCTG CTCGCAGATG CAGAATGTCA GCGAGGCGCG CATGAACGGC GTGGAAGCCT CGCTGAACGG CAACCTGGGC GCCTGGGAAG TGAACTTCAA CTACCTCTAC CTCAACCGCC AGAACCGCTC CTCCGACGAC TCCATACGGC TCACCGACAC GCCCAAGCAC AAGGGCTTCC TCAGTATCGG CCGGTGGTTC GGCCCATGGC ACCTGATGGC CAGCTCGGAC GCTTCCTCCA GCCGCTACAG CTCCACCGAC GGCACCCAGG AAGCTTCCGG ATTCGTGGTC TTCAACCTGA AGGGCGGCTA TCGCTTCGAC AACGGCCTGC AACTGGACGC CAGCGTGCAG AACCTCACCG ACCACGAGTA CGAATACAGC GAAGGCTATC CCGAACCGGG CCGCACCTTC GTCGTGCAGG CCAACCTGAC TTTCTGA
|
Protein sequence | MQHPASRLPL LAGGLCLCLN GLSQADDNVF ELGQIFVLGS HGDASRLDNT DSVDSRDMRL HDRETVGEAL NLIPGVSLSK TGARNEQMVY VRGFDLRQVP VFIDGIPVYV PYDGYPDLGR FTTFELSRIE VSKGFSSMTY GPNTLGGAIN LVTRRPQEEF EGEIGTGFSF TDRAENNGEW TYANFGTRQD NWWAQMGLSY LNEDYFRLPG NFDDERLEDG GRRGNSDRKD KKINFKVGFT PNDTDEYVLG YVKQDGEKGN PAYAGHLGGS YNRFWRWPKW DKTSYYINTV TAFGDHKVKA RLYHDIYKND LYSYDNDSYS TQKTSRAFRS YYDDYSTGLS LEDEWTLDEN NQLRLAFHHK QDVHREHDAG EPKQRFEDET QSLALEYTRK LTERLTLIAG FSHDRRNGRE AETYTSAAGL FEEEGGREST NNGQLGLFFQ ADPRTLWRFT VARKSRFPTM KDRYSYRFGS ALPNPDLKTE EATHFEIGYK RALSDSLELD LALFRSHVDD LLQSVRVAGS ACSNPPCSQM QNVSEARMNG VEASLNGNLG AWEVNFNYLY LNRQNRSSDD SIRLTDTPKH KGFLSIGRWF GPWHLMASSD ASSSRYSSTD GTQEASGFVV FNLKGGYRFD NGLQLDASVQ NLTDHEYEYS EGYPEPGRTF VVQANLTF
|
| |