Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_16170 |
Symbol | gspE |
ID | 7760552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1595326 |
End bp | 1597035 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804517 |
Product | type II secretion system protein E, GspE |
Protein accession | YP_002798807 |
Protein GI | 226943734 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTCC CGTGGATTGG GGCTTTCCCG CTGGTATTCC GAGGGGTGGC GATGAGATTG GGTGAACGGC TGGTCCGGGC CGGGCTGGTC ACTGAGGATG ATGTGCAGCG CGCCTTGGGG TTGCAGCAGC AGGCGGGTGG CCGGCTCGGC TCCATTTTCA TTCGCATGGG CGCGCTTTCC GAGGATGCCT TGCTGGTTGT GCTGCATGAG CAGTTGGGGT TGCCGGTTCT GGACGGGGCC GGGATGCCGG TCCCGTCCAC TGCCTACGCC TTCATGTCCG GTCTTCCGTT ACGGCTGGGC TGGTTGCTCG ACAACAATGT AGTGCTGTGG GAAGAGGCCG GCAGGGTGCA CGTATTGGCC AAGGACCCGA ACGATTCGCG GATCGGGGAT CTGCTGGGCT ACACCTTCGG TGACCGGGAA CTGGTGCGCT ATCTGGCCCG CGCTCAGGAT GTCGACACGC TCGTCGATCA GGTGCGCAGG GAGAGCGCCG TATCCGATCT GTTTCGCGAT GATCGCCAGA CCCTCAAGAG CCTGATCGAA GAAGCGCCGG TCGTCGAGCT GGTGAACAAT CTCCTGGCGC AGGCGGTCGA TTCGGGAGCT TCCGACATTC ATGTCGAGCC CGAGGAGACC CGTTTTACCG TGCGGATGCG AGTCGACGGC GTTCTGCACA CGCGCATGGT GCAGCCTTAC GAGCGTTATC CGGCGATCGG TTCGCGCATC AAGCTGATCG CCGGGCTGGA TATCGCCGAG AAGCGTTTGC CGCAGGATGG CCGGATTACC TTGCGCCTGA GCGGAAAGGA CATGGATATC CGGGTTTCCA CGGCGCCGGG AGTTTTCGGC GAGTCGATCG TCATGCGGTT GTTGCCGAAG AACCGTGGTG CCCTGTCGCT GGGCAGCCTG GGTTTCGAGT CCGACCATAT CGAGTTGCTC AGGCAGTGGC TGGCATTTCC TAATGGCATT GTGCTAGTAA CTGGGCCCAC GGGGTCCGGC AAGTCCACGA CCCTGTATTC GGCGCTCGAG GAGATGCGCG ATGGCAGCAA CAAGATCATC ACCGTGGAGG ACCCCGTCGA GTATCAGGTA CCGGGCGTGA CGCAGATCCA GGCGCACGCG GAAATCGGCT ACACCTTCGC TCGTGCGCTG CGGGCCATTC TGCGCCAGGA CCCGGACACC ATCATGATCG GTGAGATTCG CGACCTGGAT ACCGCGCAGA TCGCCATCCA GTCCGCTTTG ACAGGGCACC TGGTGTTGTC GACGCTGCAT ACCAATGACG CGGTTTCGGC CTTCACCCGC CTGATCGACA TGGGGTTGGA GCCTTTTCTG GTGGCCGCTT CGGTGCGCGG TGTTCAGGCG CAGCGTCTGG TGCGCAAGCT CTGCGGGCAT TGCGCGGTGC GCGACGAGGC GCCGGCCTTG CCGGCTGCCT GGCAGGCAAT GGGACGTCGC GTCGCGGAGG GCGACTGGAA ACGGGCATGC GGTTGTGCCC ATTGCCACCA GACCGGCTAT CGCGGTCGCA TGGGGATATA CGAGTTGGTA CCCCTGAGCG CTTCCCTGCA GCAGTTGGTC AATCGCCAGG CGCCTCTGCA GGAGATGAAG GCGCTGATCA AGGGTCAGGG GCACCGGGGC CTGCTGGAGG ATGGATTGAT CAAGGCCAGC AAGGGTCTGA CCAGTATCGA GGAGGTCATG CGTGTTGCTT GTGTCGAACA GGAATTCTGA
|
Protein sequence | MGVPWIGAFP LVFRGVAMRL GERLVRAGLV TEDDVQRALG LQQQAGGRLG SIFIRMGALS EDALLVVLHE QLGLPVLDGA GMPVPSTAYA FMSGLPLRLG WLLDNNVVLW EEAGRVHVLA KDPNDSRIGD LLGYTFGDRE LVRYLARAQD VDTLVDQVRR ESAVSDLFRD DRQTLKSLIE EAPVVELVNN LLAQAVDSGA SDIHVEPEET RFTVRMRVDG VLHTRMVQPY ERYPAIGSRI KLIAGLDIAE KRLPQDGRIT LRLSGKDMDI RVSTAPGVFG ESIVMRLLPK NRGALSLGSL GFESDHIELL RQWLAFPNGI VLVTGPTGSG KSTTLYSALE EMRDGSNKII TVEDPVEYQV PGVTQIQAHA EIGYTFARAL RAILRQDPDT IMIGEIRDLD TAQIAIQSAL TGHLVLSTLH TNDAVSAFTR LIDMGLEPFL VAASVRGVQA QRLVRKLCGH CAVRDEAPAL PAAWQAMGRR VAEGDWKRAC GCAHCHQTGY RGRMGIYELV PLSASLQQLV NRQAPLQEMK ALIKGQGHRG LLEDGLIKAS KGLTSIEEVM RVACVEQEF
|
| |