Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_52070 |
Symbol | |
ID | 7764044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5315143 |
End bp | 5317875 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643808023 |
Product | hypothetical protein |
Protein accession | YP_002802257 |
Protein GI | 226947184 |
COG category | [S] Function unknown |
COG ID | [COG4458] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTACC CTGACCAATC CCCCGAACGC CTGTCCCAGA GCTGGTCCGC CGTCTACGGC GGGGCGGGCG AGGCGATCCG CTGGATCGAC GAGGTGCGCC GCAGCGCGCC GCGCCTGGAC AACGAGGCCG ACGACCTGAT CCTCAGCCTG CGCCGCGTGC GCAACACCGC GCGGCGTCTC GGCGCGGTTT CCGCGCTGCC GATGACCGTG GGCTTCTTCG GCCTGTCCCA GGCCGGCAAG TCCTACCTGA TCGCCACCCT CGCCGCCGGC GCCAACGGCA AGCTGGAAAC CGACCTGGGC GGCGACCACC TGGACTTCCT TGTCCACGTC AACCCGCCGG GCAGCGGCAA GGAGGCCACC GGGCTGGTCA CCCGTTTCAG CCGTACCGCC CGGCCGGGGC CGGCGGAGTT CCCGCTGGAA CTCAAGCTGT TCGCCGAGAT CGAGCTGGCC AAGGTGCTCG CCAACTCCTT CTTCAATGAC TTCAACACCG AGAAGGTGCG CTACCGCTTC GAGGAGGCGG CGATCCGCCA ATTGCTCCGC GAGCTGGAGC GGCGCCGCCA GCCCTATCCG GTGCCGGGCG TCGACGCCGA TGCCGTGGTA GACCTGTGGG ACTACCTGCA GGGCAGCTTT CCGGGTTCGC TGGGCGCGCT GGCGGCCTAC TACTGGCCGG CCGCCGTCGA GCTGGCGCCG CGCCTGGCGC TGGAAGACCG CGCCCGGCTG TTCTCGATCT TCTGGGGCGA GATCCCCGAA CTGAGCGACA CCTACCTGGC CTTCGCCCGT ACCCTGACCA GCCTGGGCCA CGCCGAACGG GTCTTCGCGC CGCTGGGCGC CCTGGTGCGG CCGACGCCGC AGGGCGGGCT GTCGCAGGCC GACAGCATCA TGAACGTGGA CATGCTCGAA CGCCTCGGCC GGGACAACGA CCTGCGCATC GCCGTGCGTC CGTGGAAGGA CGGCGCCCTG GGCGCCGAGG TCGAGCTGAC CCTGGCGCAA CTGGCGGCGC TCACCGCCGA GCTGGTCTTC CCGCTGGTCG AGCCGACCAG TGAACCGCTC TGCGAAGAGG TCGACCTGCT CGACTTCCCC GGCTACCGCG GCCGGCTCGG CGTCGAGTCG CTGGACGAGG TGCGCCGCGC GGTGAACAAC GAGGACAGCA ACCCGGTGGC GCAGCTCCTC CTGCGCGGCA AGGTCGCCTA TCTCTTCGAG CGTTACACCG ACAGCCAGGA AATGAACGTG CTGATCGTCT GCACGCCGTC GAACAAGCAG TCGGACGTCA CCAGCGTCGG CCCGGTGCTG AGCCGCTGGA TCGACCGCAC CCAGGGCGAG ACGCCGGCCG AGCGGGCGCG CCGCAAGCCG GGCCTGTTGT GGGCGATCAC CATGTTCGAC ATGCGCATCG GCAGCGACCT GGACAAGGGC GAGGACCTGC TGCGCCTGGG CTGGGGCAGC GGCGGCATGA TGAAGATGAC CATGCTGGAA CGCTTCGGCC AGTACGCCTG GCTGCAGGAC TGGGCCGACG GCCGGCCGTT CGACAATACC TTCCTGGTGC GCAAGCCGCG CATGAAGGTG ACCTTCCTCG ACCTCGACGG CGGCCGGGAA ACGGCCATCC ACGAGGCCGC TCGCGAGTCG CTGGCGCTGA TGCGCCAGAC CTTCTGCGAG GACGAGACGG TCCGCCGCCA CGTCCGCGAC CCCGAGGAAG CCTGGGACGC CATGCTGGCG CTGAACGACG GCGGCATGGG CCGCATCGGC CGCTACCTGC GCCAGGTCGC CCTGCGCGAG GTCAAGCAGG GGCGCATCCG CGAGCAACTG GATGCCATCC TGCACCTGAT CGAGAACCGC CTCGGCCACT GGTTCCAGGC CGAGGGCGCC GGCGAGCTGG AGCGCAAGCG GCGCATCGCC CGGCAGATCT CCGACGCCCT GTGGCCGCGG CGCCTGCTGC TCGGCGAGCT GCTGCAGCGC ATGCAGCTCC CGGACGACCT GCTGCGCGCG CTCTACCTGC GCGCCGGCGA GGAAGACGAA GCGCCGCCGC CGGCCGCCGG CGAACGCCTG CTGCCGGGCG CCGCACCGCT GGCCGGGGCG CTCAATCTCG GCCTCGACCT GAACGGCGAC GACAGTTTCG ATCCCTTCGG CGACAATGGC CCTTCCGGCG CGGTGCCCGC GGCGCCGGCG CGGCAGGCGC GCGGTAGCGA CGCGCGCTTC GCCCAGGCGG TGCTGCGCGA GTGGATCGGC CACCTGCGCC ACCTGCCCGA GGACGTGCGC CTGATGACCT ACCTGGGCTT TGCCAAGCCG GCGGTCGAGG CTCTGATCGA CGAACTGGTC ACCGGCGCCA GCCGCCTCGG CCTGGAGCAG CGCCTGTTGC AGGCGATCGT CAGTACCGAG CAGGTCGGTA CCAAACGCGA GCAGCTCGCC GGCCGCCAGG TGCTCACCGC CAAGACGGTG CTCGGCGACT TCATCGGCTG GCTGGGCTTC ATCGAGCAGT CCCTCGAACA ACGCCCGGAC AGCCGCATCG AGCGCGACGC CAGGCTGTTC CAGCCGGCGC CGCCGATCGC CCCGGGCCAG TTGCCGCGCC TGACGGAGCA ACCCCAGGAC TACACCCGGC GCTACGTCGG CGACTGGCTG GTGGCCCTGG CGCGGATCGC CGAGGAAAAT GCCGGACACA GCGCCGGCCG GGAAATCAGC CTCGAAGAGA ACGAAGCGCT GGGCCGCATC CTCGCCACCT TCCAATCGGC ACGGGCAGAC TGA
|
Protein sequence | MTYPDQSPER LSQSWSAVYG GAGEAIRWID EVRRSAPRLD NEADDLILSL RRVRNTARRL GAVSALPMTV GFFGLSQAGK SYLIATLAAG ANGKLETDLG GDHLDFLVHV NPPGSGKEAT GLVTRFSRTA RPGPAEFPLE LKLFAEIELA KVLANSFFND FNTEKVRYRF EEAAIRQLLR ELERRRQPYP VPGVDADAVV DLWDYLQGSF PGSLGALAAY YWPAAVELAP RLALEDRARL FSIFWGEIPE LSDTYLAFAR TLTSLGHAER VFAPLGALVR PTPQGGLSQA DSIMNVDMLE RLGRDNDLRI AVRPWKDGAL GAEVELTLAQ LAALTAELVF PLVEPTSEPL CEEVDLLDFP GYRGRLGVES LDEVRRAVNN EDSNPVAQLL LRGKVAYLFE RYTDSQEMNV LIVCTPSNKQ SDVTSVGPVL SRWIDRTQGE TPAERARRKP GLLWAITMFD MRIGSDLDKG EDLLRLGWGS GGMMKMTMLE RFGQYAWLQD WADGRPFDNT FLVRKPRMKV TFLDLDGGRE TAIHEAARES LALMRQTFCE DETVRRHVRD PEEAWDAMLA LNDGGMGRIG RYLRQVALRE VKQGRIREQL DAILHLIENR LGHWFQAEGA GELERKRRIA RQISDALWPR RLLLGELLQR MQLPDDLLRA LYLRAGEEDE APPPAAGERL LPGAAPLAGA LNLGLDLNGD DSFDPFGDNG PSGAVPAAPA RQARGSDARF AQAVLREWIG HLRHLPEDVR LMTYLGFAKP AVEALIDELV TGASRLGLEQ RLLQAIVSTE QVGTKREQLA GRQVLTAKTV LGDFIGWLGF IEQSLEQRPD SRIERDARLF QPAPPIAPGQ LPRLTEQPQD YTRRYVGDWL VALARIAEEN AGHSAGREIS LEENEALGRI LATFQSARAD
|
| |