Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_40090 |
Symbol | |
ID | 7762896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4063178 |
End bp | 4065358 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643806869 |
Product | outer membrane copper transport protein |
Protein accession | YP_002801121 |
Protein GI | 226946048 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01778] TonB-dependent copper receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCG ACAAGTTTCG CCGTGCGGCG CGCCCGCACC TGCCCAAGAC CTTTTCCCGC CTCCTGCCGC ACTGGCGCCG GGCACCGGCG CATGCCGCGC TGCTCGGCCT ACTGGCCAGC GGCAGCCTGC CGGCCGCCGA GCCGGAAGAC GACAGCGACT CCCTGCAAAT GTCGCCGCTG GTGATCACCG GCGTGGCCCT GGATGCGCCG CTGACCGTGG TGACCGACCC GCGCATCCCC CGCCAGCCGG TGCCGGCCAG CGACGGCGCC GACTACCTGA AGACCATTCC CGGCTTTTCC GCGATCCGCA GCGGCGGGGT GAACGGCGAC CCGGTGTTCC GCGGCATGTT CGGCTCGCGG CTCAAGCTGT TGACCAACGG CGGCGAGATG ATCGGCGCCT GCCCGGCGCG GATGGATTCG CCCAGTTCCT ACATCGCGCC GGAAACCTTC GACGAACTGA CCGTCATCAA GGGCCCGCAG AGTGTGCTGC ATGGCCCCGG CGCCTCGGCC GCCACCGTCC TCTTCGAACG CAACCCGGAA CGTTTCGACG CGCCGGGCGG GCGTGTCGAC ATGAGCTTCC TGGCCGGCTC CAACGGTCGC TTCGACCGCC GCATCGATGC TGCCGCCGGC GCCGAGCAGG GTTATTTCCG CCTGCTCGCC AACCGCTCCG ACGCGGACGA CTACGAGGAT GGCGACGGCG ACACCGTGCC CTCGCGCTGG GACAAGTGGA CCAGCGAATT CGTCGCCGGC TGGACGCCGA CCGCCGACAG CCTGCTGGAA CTCACCTACG GCATCGGCGA CGGCGAGTCC CGCTACGCCG GGCGCGGCAT GGACGGCACC CGGTTCGACC GCGAGAGCCT GGGGCTGCGC TTCGAGCTGG AGAACATCGG CGAGGTTTTC CAGAAGGTCG AGGCGCGGCT CTACTACAAC TACGCCGACC ACGTGATGGA CAACTTCCGC CTGCGCAGCC CGGATGCGGA TTCCAGCATG CCGATGCCGA TGGCCAGCAA CGTCGACCGC CGCACCCTGG GCGGACGCCT GACCGGCACC TGGCGCTGGG AGCACTACCA ACTGGTCGCC GGCCTCGACT ACCAGACCAG CGAGCACCGC GAGCGCTCCT CCACCTACAC GAGTGCGAGC CACGGCATGT CCATGGGTCA TCACGTCATG ACCAGCGCCT CGTTCGTCGA CGCCGACGAC TATCCCTGGA ACAAGGACGC CGATTTCCAC AACTTCGGCC TGTTCGGCGA ACTGACCCGG CACCTGGGCG ACGCCTCGCG GCTCGTCCTC GGCGCGCGGC TGGATCGCAG TTACGCCGAG GATCACCGCG ATACGCTGAG CGGCAGCATG GGCATGTCGA GCTGGAACAA TCCCACCGCC GGCAAGGAGC GCGGCGACAC CCTGCCCAGC GGCTTCGTCC GCTACGAACA CGGCCTCGCC GGCATTCCGG CGACGGCCTA TGTCGGCCTG GGGCACACCC AGCGTTTCCC GGACTACTGG GAACTGTTTT CCTCCACCAA CAGCATGGGC CGGACCAGCG CCTTCGATTC CATCCACCCG GAGAAGACCA CCCAGCTCGA CTTCGGCATC CAGTACGCGC GGGGGCCGCT GGAAGCCTGG GCCTCCGGCT ACCTCGGCTG GGTGCGGGAT TTCATCCTGT TCGACTACGC CAGCGGCATG TCGGGCATGG GTTCGTCCTC GGCGCGCAAC ATCGACGCAC GCATCTTCGG CGGCGAGGCG GGTCTGTCCT ACCGCTTGAG CCAGCATTGG AAGACCGACG CCAGCCTGGC CTACGCCTGG GGCAAGAACA GCTCGGACGG CCGCGCGCTG CCGCAGATCC CGCCGCTGGA AGGGCGTTTC AGCCTGACCT ACGAGCAGGG CGACTGGAGC GCCTCCGGCC TGTGGCGGCT GGTGGCCAGG CAGACGCGGG TGGCGGAAGG CCAGGGCAAC GTGGTGGGGC AGGATTTCGG CAAGAGCGCC GGCTTCGGCG TGCTCTCCTT CAACGGCGCG CACCGCTTCA ACCGGCACCT CAAGCTCAGC GCCGGCGTCG ACAACCTGCT GGACAAGCGC TACAGCGAAC ACCTCAACCT GGCGGGCAAC GCCGGCTTCG ACTATCCGGG CGACACGCGG ATCAACGAGC CGGGGCGGAC CTTGTGGGCG CGGGTGGACC TGAGCTTCTG A
|
Protein sequence | MNRDKFRRAA RPHLPKTFSR LLPHWRRAPA HAALLGLLAS GSLPAAEPED DSDSLQMSPL VITGVALDAP LTVVTDPRIP RQPVPASDGA DYLKTIPGFS AIRSGGVNGD PVFRGMFGSR LKLLTNGGEM IGACPARMDS PSSYIAPETF DELTVIKGPQ SVLHGPGASA ATVLFERNPE RFDAPGGRVD MSFLAGSNGR FDRRIDAAAG AEQGYFRLLA NRSDADDYED GDGDTVPSRW DKWTSEFVAG WTPTADSLLE LTYGIGDGES RYAGRGMDGT RFDRESLGLR FELENIGEVF QKVEARLYYN YADHVMDNFR LRSPDADSSM PMPMASNVDR RTLGGRLTGT WRWEHYQLVA GLDYQTSEHR ERSSTYTSAS HGMSMGHHVM TSASFVDADD YPWNKDADFH NFGLFGELTR HLGDASRLVL GARLDRSYAE DHRDTLSGSM GMSSWNNPTA GKERGDTLPS GFVRYEHGLA GIPATAYVGL GHTQRFPDYW ELFSSTNSMG RTSAFDSIHP EKTTQLDFGI QYARGPLEAW ASGYLGWVRD FILFDYASGM SGMGSSSARN IDARIFGGEA GLSYRLSQHW KTDASLAYAW GKNSSDGRAL PQIPPLEGRF SLTYEQGDWS ASGLWRLVAR QTRVAEGQGN VVGQDFGKSA GFGVLSFNGA HRFNRHLKLS AGVDNLLDKR YSEHLNLAGN AGFDYPGDTR INEPGRTLWA RVDLSF
|
| |