Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11850 |
Symbol | pilW |
ID | 7760127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1135783 |
End bp | 1136970 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643804087 |
Product | type IV pilus assembly hypothetical protein, PilW-like protein |
Protein accession | YP_002798389 |
Protein GI | 226943316 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4966] Tfp pilus assembly protein PilW |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.983551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGC GTTTCAATGG CGGACGGCGG TCCGCCCGCA TATTCATTCA TTGCGCCCAG ACCGGTTTCA GTCTGATCGA ATTGATGGTC GCCTCGACCA TCGGACTCTT GATCATGACG GCGGTCTTGA CGCTGTTTCT CAATGTGAGT CGCACCAATG ATGAAATGGC CAAGACCAAT ATGCTGATCG AGAATGGACG ATTCGCGATT CAGTTATTGC AGAACGATAT CGCCCATGCC GGTTTCTGGG ATAGTTATAT TCCGGACTTC GACGATCTGA CAGTGACTGC GGTACCGGCC GAGATTCCGG CTGTCGTTCC CGATCCTTGT CTGGCGTATT CTTCATGGAC AGCAGCGACT CGTACCAGCC AGTTGGGAGT TCCGCTGCAG GTCCATGATT CGCCACCTTC CGAATGCGGG GCCGTGATAA CCAATCGCAG AAGTGGTACC GATATTCTGG TGGTGCGCCA CTTGAATACC TGCGTTGCTG GCGGCTGTGA AGCGGAAGAG AGCGGGAGAC TCTATTTTCA GGCATCCCGT GCCCGTGGAG GCGACTGTCC CGCCAGCGTT TCTTCGGAGG CTCCATATAT ATTTTCCACC GATCCTGCCG ATTTCGTCCT GCATGATCGC GACTGTACGA CTATCGCTTC CAGGCGAAGA TTCCTTTCCC ATATCTACTA TATCCGCGAC TACGCGGTGA CGCTGGGAGA TGGTGTTCCG ACCCTGGTGC GTTCGGAACT CGATCTGGAG GCTGGAGAGA TCAAGGCCAG ATCCGCCGTT GTACTGATCG AGGGTATAGA AGGATTCAGG GTCGAGTTGG GCGTGGATCG GATCAGCGAT TCCGGCAGGG ATATCATCGT CGAAAGTTCC GACGCAGATC CCTACAGGGA GGCGGTCGAG TGGGCCGATA GGAAAAACCT GACCTCTCCC GTGAATCGTG GAGATGGCGT ACCGGACGAG TTCGTTCATT GTTCGGGAGT TTGTTCGCTC GACAGGTTGA TCAATGTGGT CGCGGTGAAG CTCCATCTGC TGGTGCGTGC TCAGACCGGC ACTCCCGGAT ATACGGACGG CAAGTCTTAT ACCTTGGGGG AGCAGGCGGT CGCGGCCGCC AACGATGGTT TCAAGCGTCA TGTATTTTCC ACGGTCGTCC GGCTGAACAA TGTTTCGGGA AGAAGGGAAA CGCCATGA
|
Protein sequence | MNKRFNGGRR SARIFIHCAQ TGFSLIELMV ASTIGLLIMT AVLTLFLNVS RTNDEMAKTN MLIENGRFAI QLLQNDIAHA GFWDSYIPDF DDLTVTAVPA EIPAVVPDPC LAYSSWTAAT RTSQLGVPLQ VHDSPPSECG AVITNRRSGT DILVVRHLNT CVAGGCEAEE SGRLYFQASR ARGGDCPASV SSEAPYIFST DPADFVLHDR DCTTIASRRR FLSHIYYIRD YAVTLGDGVP TLVRSELDLE AGEIKARSAV VLIEGIEGFR VELGVDRISD SGRDIIVESS DADPYREAVE WADRKNLTSP VNRGDGVPDE FVHCSGVCSL DRLINVVAVK LHLLVRAQTG TPGYTDGKSY TLGEQAVAAA NDGFKRHVFS TVVRLNNVSG RRETP
|
| |