Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29150 |
Symbol | |
ID | 7761818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3006037 |
End bp | 3007743 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643805791 |
Product | type II secretion system protein E |
Protein accession | YP_002800059 |
Protein GI | 226944986 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0342357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAATC TTTCTCCGGA ACATGCGGCC GACGGGGCGC CGCGGGATGA GGGCCGTTTC GCGGCGGAGC TGCTGTGCGA GGCCCGCGAG CGCGCGGCGC GCGACGGTGG GCGGGCGCTG GAGGCGCTGG AGACGCTGGC CGGGCTGGCG CCGGAGGCGT TCGTCCGCCG TCTGGGGGCG ACCCTGCATT ATCCGGTGCT GGTCGGCGAG ACCCTGTTCG CCTGCCGTCC GGCATTCGAG CGGCTGGCGC TGGCCCAGGC GCTCAAGCGC GAATTCGCCG TGGTGCGCCT GGACGAGCGC GAGGTCGGGG TGTTCGCCGA TCCCTTCGAC GACGACCGCC TGGCCTGGAT CGACGCCTGC CTGGAGGGCG CGCCGCTGTA TCTGGTGCAT CCGGCCGACC TGGCCGGCTA CCTGGCGCGC CACGAGGAGG CCTTCCAGGC CGTCGAGTCG CTGGGCGCCG ACGCCGCCGG GACGGTCGAG GCCGAGGCGG TGGAGAATCT GTCGCTGGCG CGGATCAGCG AGGACTCCAG CGTGGTGGTC AAGCTGGTCA ACTCGACCCT CTACGACGCG CTGAAGTTGC ACGCCAGCGA CATCCACCTG GGCATGACCG GCAACGGCCT GGTGATCAAG TACCGCATCG ACGGGGTGCT CAACTCCATC GGCCGGGTGC AGGGCAGCGA GACCGCCGAC CAGGTGATCT CGCGGATCAA GGTGATGGCC GAGCTGGACA TCGGCGAGAA GCGGGTGCCC CAGGACGGTC GCTTCAAGGT GGGTATCCAC GGCCGGCAGA TCGACTTCCG GGTGTCGATC ATGCCGAGCA TCTTCGGCGA GGACGCGGTA CTGCGGGTGC TCGACAAGCA GGACCTGGCC GACCGCGTCA GCGGCGTCAG CCTGGAGGCG CTGGGCTTCG AGGAAGCGAC CCTGCGCAAT CTGCGCCGGC TGGCCGCCGA ACCCTACGGC ATGGTGCTGG TGACCGGCCC CACCGGCAGC GGCAAGACCA CCACCCTGTA CGCGATGATC ACCGAGATCA ATCACGGGGT GGACAAGATC ATCACCATCG AGGACCCGGT GGAATACCAG TTGCCGGGGG TGCTGCAGAT CCCGGTCAAC GAGAAGAAGG GCCTGACCTT CGCCCGCGGC CTGCGCTCGA TCCTGCGCCA CGACCCGGAC AAGATCATGG TCGGTGAGAT CCGCGACCCG GAAACCGCGC AGATCGCCGT GCAGTCGGCG CTCACCGGCC ACCTGGTGTT CACCACCATC CACGCCAACA ACGTGTTCGA CGTGATCGGC CGCTTCGTGC AGATGGAGGT CGATCCCTAC AGCTTCGTCT CCGCGCTCAA CGCGGTGCTC GCCCAGCGCC TGGTGCGCCT GGTCTGCCCG GACTGCGCCG CGCCGGTCGA GCCGGGCGAG GAGGAACTGC TGACTTCCGG GCTGGACCCG GCCGGAGTCG GCGATTTCGG TTTCGTCCAC GGCAGCGGCT GCGGGCGCTG CCGTGGCTCG GGCTATCGCG GGCGCAAGGC GATCGCCGAG TTGCTGCTGC TCGACGACGA GATCCGCCAG ATGATCGTCG AACGCCAGCC GATCACCCGG GTCAAGGAAC TGGCCCGCCG CCGTGGTCTG CGCCTGCTGC GCGATTCGGC GCTGGAACTG GTGCGCAGCG GGCGAACCAG CCTGGAGGAG ATCAACCGTG TCACTTTCGC GGGCTGA
|
Protein sequence | MDNLSPEHAA DGAPRDEGRF AAELLCEARE RAARDGGRAL EALETLAGLA PEAFVRRLGA TLHYPVLVGE TLFACRPAFE RLALAQALKR EFAVVRLDER EVGVFADPFD DDRLAWIDAC LEGAPLYLVH PADLAGYLAR HEEAFQAVES LGADAAGTVE AEAVENLSLA RISEDSSVVV KLVNSTLYDA LKLHASDIHL GMTGNGLVIK YRIDGVLNSI GRVQGSETAD QVISRIKVMA ELDIGEKRVP QDGRFKVGIH GRQIDFRVSI MPSIFGEDAV LRVLDKQDLA DRVSGVSLEA LGFEEATLRN LRRLAAEPYG MVLVTGPTGS GKTTTLYAMI TEINHGVDKI ITIEDPVEYQ LPGVLQIPVN EKKGLTFARG LRSILRHDPD KIMVGEIRDP ETAQIAVQSA LTGHLVFTTI HANNVFDVIG RFVQMEVDPY SFVSALNAVL AQRLVRLVCP DCAAPVEPGE EELLTSGLDP AGVGDFGFVH GSGCGRCRGS GYRGRKAIAE LLLLDDEIRQ MIVERQPITR VKELARRRGL RLLRDSALEL VRSGRTSLEE INRVTFAG
|
| |