Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11870 |
Symbol | pilY1 |
ID | 7760129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1137512 |
End bp | 1140622 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804089 |
Product | type IV pilus assembly protein, PilY1-like protein |
Protein accession | YP_002798391 |
Protein GI | 226943318 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGGCCG CTACCCTGCT GGCGATCGGA GCTGGCGCGG ATGCGGAGGA TATCGACCTG TTCGTCGGCG TGGAATCGGA ACGGGGTACG GATTACCCGA ATGTCGTCAT CCTGCTCGAT AACACGGCCA ACTGGGCGGA CGACGCTCAA TATTTCCCGG ATATGAAGCA AGGGCTGGCC GAACTGAGCG CCATCAGGAC GGTGATCGGT GCGCTGGCGC CGGAAGGGGA GGACGCCAAG CTCAACGTCG GGCTCATGAT GCTCGACGAG GGGACCGGAC AGCGCTTCGA CGGCGGTTAT GTGCGCTCGC ACGTCAAGCA GATGACCAAG GGGGCCAGGG CCGAGCTTCT GGCCGAAATT GCCGAGATCG AGTCCGAATT CAGTACGGGC GGGCAGGTAA CGACGGAGGT GGGCTATAGC CGGGCGATGT TCGAGGTATT CAAATACTTC GGCGGTCATA CCAGTCCGGC CAATGCCTAC GATGACACGG CCGGTTCGCC AGTCGGTCCG ACCCACTTCG GGCCGTGGCG TTATTCGGGT GACGATGTCT TCGACAGGCG GGATGCGGCC GCCTTCTCCG ATAGCCGGCA AACCCTCTAC AAGCCCGTCG TACCGGCCAC GGAAACCTGC AAGTCGAAAA ACTACCTCAT CCTGATCGGC AACGGCTGGT CGCCGAATGG CGAGGACGAA GTCGCCACCC TGCTGAAGAA CGTGGGGGGA GATGCCGGTC AGCTCTATCA AACCACCTCG TCCAAGGTGC GTTATGGCGA CGAGATGGCC CGCTTTCTAT ATCAGACCGA CGTCAGCGCG GCGCCCGGCA AGCAGAACGT GCTTACCTAC GCCATCGACG TCTACAACAG GCAAGCCAGC GAGGATCACA GCAGGCTGCT GAAGAGCATG GCGCATGTCG GGGGCGGGAA ATATTTCTCG GCGACCAATG CCGATGAAAT CGAGACGGCA CTGAGCACCA TCTTTTCCGA GATTCAGGCG GTCGACAGCG TATTCGCCTC GGTCAGCCTG CCGGCCAGCG TCAACACCCG GGGCACCTAC CGCAATCAAG TCTATGTCGG GATGTTCCGC CCGGATGCGA ATGGCCTGCC TCGCTGGGCG GGTAATCTCA AGCAGTACAA GCTCGGCCTG GTCGACAATG CGCTCAAGTT GCTGGATGCC GAAGGCGAGC AGGCGATCGA CAGTTCGACC GGCTTCATCG GCGAATGCGC CCGCAGCTTC TGGGGCCCTG CATCGGTCAC GCAGCCCGCC TACTGGGGTA ATGTCAATGG CGCTCCGGCC GGCGGCTGTC TTGCCGTCGC CGACTCGGTC CATGCGGACT ATCCGGATGG CAACGTGGTG GAGAAGGGGG CCCAGAGCTA CAGGCTTCGT ACCCTGAGCG GCAGTGACGC ACGCAACATC AAGAGCTGTG CGACGACGGC CTGTACCGAA CTGATCGACT TCCATGCTCT GACGGTCGGT TCGATCGCTG CCGACGAGAT CAACTGGGGG CGTGGCGGCA ACGGTGAAAC GGAGTTTGTC GATCCGACCG CTATCAGCGC GACGACCGTA CGCCCATCGG TGCATGCCGA TGTGCTCCAT TCCCGTCCCA TGGCCATCAA TTTCGGCACC GATAGCGATG CCGAGGTGGT GGTTTTCTAT GGGGGGAACG ATGGCATCCT GCGCGCGGTG AACGGCAACC GCGACTCCGG TGCCTCCATC GGCGGCATGG CGCCTGGCGG CGAGCTGTGG TCCTTCATGG CCCCGGAATT CCATCCCCGG ATCGCACGGC TGCGCGACAA CATCGTCGGC ATCGCTTACA AGGATCGGAC AGCGGTGGCG TCCGGCGCGG TACCCGAGCC CGAGCCGAAA CCCTATGGCT TCGACGGCCC GATCACGGCC CATCGTTACA GCGGAGGTGC GTGGATCTAC GCCGGCATGC GTCGCGGAGG GCGTGCCCTG TATGCTTTCC AGGTGTCCGA CACTGCCTTG GCCAAGCCAG TGTTCAAATG GCGGATTGGT TGTGACAGCG ACATGAGCGG TACGGACTGC ACCGATGGTT TCGAGCGTCT GGGACAGACC TGGTCGTCGG CCAGGCCGTT CCAGACGGCG GGCTACGATT CCGGCAAGTC TCCGCTGCTG ATCATGGGGG CTGGCTACGA TACCTGCGAG GACAAGACCG ACGACCTCGC CCATAACCAT CTTTGCGGCG ACAACCCGCT GGGCAACCGG ATCTACGTGA TGGATGCCGA CGACGGGGAA CCGATCGCCG AGTTCAAGAC CGAGCGCAGC GTGGTCGGTG ATGTCACCAT AGTGCCCGAC GAGAGCGGCT TGGCCATTTA CGCCTATGTC GCGGATACCG GCGGGACGGT TTACCGGATC GCTTTCGACG CGGCCGGGCC CGGCGACTGG AGCATGGTCC GGATCGCTTC GCTGGGCTGC GATGGACGTT CATCCTGCGA TGCCAACCGG AAGTTCATGT TCGCGCCGGA CGTGGTGATG CTGGGCGATA CCCATTACGT GCTGCTGGGC TCGGGAGATC GCGAGAAGCC GCTGGCCAGC TACGCGGCGG CCACCAGCGT CGCCAACCAT TTCTTCATGC TCAAGGACAA ACCGACGGAT ACGGCCTGGC TGACCGACGA GAACGCGAAG GGTGTTTGCG CTGGCGATTA TCTATGCATG GACTCGCTAT ATCCCATTAC GACCCCGGAT ACGCCTAGCG AAGCGCTTCT GGCACGGAAG AAGGGCTGGT ATCTGGCGCT CGCCGATAGC GAGCAGACGG TGACTTCGGC CATCACGGTC CATGGCACCG TGACCTTCAG TACGCACACG CCGGCGGTCC ATGGCCCCGG CCAGTGCTCC AGACTGGGAA CCGCCAGGGT GTACAACATC GACTACCGCA ATGCCGAGAG CGGGAACGGC GGCGCCATGC GTTACGAAGT CATCGTGGGC GGCGGTTTGC CGCCGTCCCC GGTAGCCGGC ATGGTGACGC TCGATGACGG TTCGAGCGTG CCTTTCATCA TCGGCTCCGA TCCCGACTCC CCGCTGGAAG GCGGATCTCC CAGGGCGGCT TCAGGTGTCG TCCAGCCCAA GGCCAAGGTC TATTGGAATA TCGAGCGATG A
|
Protein sequence | MLAATLLAIG AGADAEDIDL FVGVESERGT DYPNVVILLD NTANWADDAQ YFPDMKQGLA ELSAIRTVIG ALAPEGEDAK LNVGLMMLDE GTGQRFDGGY VRSHVKQMTK GARAELLAEI AEIESEFSTG GQVTTEVGYS RAMFEVFKYF GGHTSPANAY DDTAGSPVGP THFGPWRYSG DDVFDRRDAA AFSDSRQTLY KPVVPATETC KSKNYLILIG NGWSPNGEDE VATLLKNVGG DAGQLYQTTS SKVRYGDEMA RFLYQTDVSA APGKQNVLTY AIDVYNRQAS EDHSRLLKSM AHVGGGKYFS ATNADEIETA LSTIFSEIQA VDSVFASVSL PASVNTRGTY RNQVYVGMFR PDANGLPRWA GNLKQYKLGL VDNALKLLDA EGEQAIDSST GFIGECARSF WGPASVTQPA YWGNVNGAPA GGCLAVADSV HADYPDGNVV EKGAQSYRLR TLSGSDARNI KSCATTACTE LIDFHALTVG SIAADEINWG RGGNGETEFV DPTAISATTV RPSVHADVLH SRPMAINFGT DSDAEVVVFY GGNDGILRAV NGNRDSGASI GGMAPGGELW SFMAPEFHPR IARLRDNIVG IAYKDRTAVA SGAVPEPEPK PYGFDGPITA HRYSGGAWIY AGMRRGGRAL YAFQVSDTAL AKPVFKWRIG CDSDMSGTDC TDGFERLGQT WSSARPFQTA GYDSGKSPLL IMGAGYDTCE DKTDDLAHNH LCGDNPLGNR IYVMDADDGE PIAEFKTERS VVGDVTIVPD ESGLAIYAYV ADTGGTVYRI AFDAAGPGDW SMVRIASLGC DGRSSCDANR KFMFAPDVVM LGDTHYVLLG SGDREKPLAS YAAATSVANH FFMLKDKPTD TAWLTDENAK GVCAGDYLCM DSLYPITTPD TPSEALLARK KGWYLALADS EQTVTSAITV HGTVTFSTHT PAVHGPGQCS RLGTARVYNI DYRNAESGNG GAMRYEVIVG GGLPPSPVAG MVTLDDGSSV PFIIGSDPDS PLEGGSPRAA SGVVQPKAKV YWNIER
|
| |