Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_34230 |
Symbol | |
ID | 7762318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3494663 |
End bp | 3497503 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806285 |
Product | hypothetical protein |
Protein accession | YP_002800547 |
Protein GI | 226945474 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGGG TTCGCAAACT GGCACTGGCA ATCGCGGCGG CCTCGGTGCT GTCTTCGGGC ATGGCCCATG CACTGGGCCT TGGAGAGGTG ACCCTGCGTT CCGCGCTGAA TCAGCCTCTG GTTGCTGAAA TCGAACTGCT GGAGGCGCGC GATGTCGGCG CGGAAGAAAT CGCCCCGGCC CTTGCTTCTC CCGATGCGTT CGATAAGGCG GGCGTCGATC GACAGCACTT TCTGAACGAC CTGAAATTCA CGCCAATAAT CGGACCCAAC GGCAAGCGTG TCATTCGGGT GACCTCCACC AAGCCGGTGC GCGAGCCTTA TCTGAATTTC CTAGTGGAAG TGCTTTGGCC CAATGGCCGG TTGTTGCGCG AATATACGCT GCTGCTCGAT CCGCCGCTGT ATTCGTCGCA GACGGCCATG TCCGCCGCGC AGAAAACCTT GCCCGAACAG CGCTCCGGTG CCGTCCCGCG TGCTCCGGCG GCTGCCGCGG CGTCCCGGCC GAACGCTTCG CGTCAGGAGG CCACCGGCGC GGCAGTCAGG AAATACAGGA CCGTCGCCAA CGACACCTTG TGGAAGGTCG CCGAGCGCGT GAGGACCTCG GGCACCATCC ATCAGACCAT GCTGGCGATC CAGGATCTGA ATCCGCAGGC CTTTCTCGAT GGCAACATCA ATCGTCTGGC GAGGGGGCAG GTTTTGCGCC TGCCCGACGA GACGCAGATC CGCCGCCGCT CGGCAAGCGA GGCGCTGGCC GAGGTCGCCG CGCAGAACGC GGCTTGGCGC GAGCATCGTG CCCGCCCGGC CGTCGCTTCC GCCCGGCAGT TGGATGCGAC CCATCGTACC GAGGCCGGCG CCGCGCCGGC GCGCGTGGAG ACCGGCGATA GCCTGCGCCT GGTCGCCGCC GACGGCGGCA AGGCGACTGC CGGTAGCGAC CAGGGAGCGG ATGGCAAGGC GGCTGCCGAC AAGCTGGCAG TGGCCAAGGA AAATCTCGAT GCGACCCAGC GCGAGAATGC CGAGCTGAAG AGCCGCATGA ATGACCTGCA GAGCCAGTTG GACAAGTTGC AGCGCCTGAT CGCTCTCAAG GACGAGCAAC TGGCCAGGCT GCAGGCGAAC CTGGCGCAAT CGGACGAGGC TGGTGGAGGC GCTTCCGCCG ATCTGGCCGC CGGCAATCCG GTGCAGGCGC AATCCGCTGT CGTCGACGCA CCGGCAGCCG GCCAAGCGGA GTCCGCGAGC GACGTTCCCG CGCGGATGGC GGTTGCATCC TCCGACGTAG AGGCATCTGC GCCCGCTGCA CCGCCAGCCG TCGTCCCTGA GCAGTCCGAA GCCGTCTCCG CCGTTCCTGT CCCACCCTCC GCTGTCGTCG CGCCTTCCGT GCCGCCAGTG GCCGAACCGG CCGCCGATAC CCCGCAGCAG CCCGCACCGA CCGAGCCGGC GGCTCCCCGG CCGCCGCAGT CGACTGTCGC GTCGGGGCCG ACCGCTCTGG TCGACTCCAT CCTGAGCAAT CCGCTGCTGC TTCCGTCACT GGGCGGCGGT GCCGCCGCCG CGCTGCTGTT GGGCTTGCTG GTAGCCCGTC GACGTGCGGC GAAGAAGACC GACTTCCAGG ATGAAGATGA CGAGGGTGGC ATGACTGATA CTCGGCTGGA TGCCGGCTTG TCGCGGCTGC CCGTCGAGCC CACCCTGTTG AACGGGTTCT CCGGGGCAGG GCCGAAGCGG GATGCAGTGA GTGAAGCCGA AGAGCATATT GCCTGCGGCC GCTTCAAGGA GGCTGCCGAG TCGCTGGAAG CTGCCTGCGC CGCCGAGCCA GGGCGCAGCG AGCTGCGCCT GAAGCTGATG GAGGTTCGTG CCGAGCTTGG CGATCGTGAG GGTTTCGCCC GTCAGGAGCG GGCGCTGCGC GAAACCGCTG GCGCCCAGTC ACAGGTCGAT CACCTGAAGA TGAAGTATCC GGCCATGGCC GGTTTCGCCA CCATCGCTCT GGCAGGTTCC GCCCTGGCAA GCGGACCGGA GTCGTCCGAA GGGGCGCAGG AGCGGCCTTC GCTGGCGGAA GAGTCCGAGC CGCCCCAACT TCTTGCAGCC GATATCGGCC TGAGGTTGGA TGATCTGGAG GCCGAGCTGG AAAGGGATCT TCAGCATTCC GCCCAGGACG ATGCTACATC GACTCCGGGC GACCTGACGC AGGACGAAGC GCCGCGGTTG GCGTCGCCTG TGGAGGAGCC TTTGATCGAG GATTTCTCCT TCGACCTGGA CCTTCCCGAT GAGGCCATCG ATTTCGAATT GGACGAAGAT CCGGAGGGTA TCTCCTTGGC TCCCCTACCG CTGACCGGAC CAGCAACCGG GGACAAGGCA TTCCCCGCGC TCGACCCGGG CGCCCCCGAG CTTCCCGTCG CTGCATTGGA CGAGACGCTT TCCCCGGAGG AGGCGTTTCT TCTCGAGGAG GGGCTGTTCG ACGGCTTCGA GCTGCCCGTA GACGACGATT TTTCCATCGC CCCCACTCCG GAGGCTTCGG GATTCGAGCT GCCATCAAGC GATGGTGCAT CGGAGGGCCA GGGGGTGTCG GGCCTCGTGG CCCAACTGGA TGAGCTGGAT GTCGAGCTCA AGCGACTGGC CGCCGGGCTG GGCGAGGGCG ATACGCAGCC GGCGGCTCGT TCGCAAGGTG ATCCGGTGGA GAGCGAGGAC TTCGATTTCC TGGCCGATGC GGACGAAACC GCCACCAAGC TGGATCTGGC TCGCGCCTAT ATCGACATGG GCGATACCGA AGGCGCCCGC GATATCCTTG AGGAAGTATT GAACGAAGGG AATGAAGTTC AGCGGCAGGA AGCGCGTGAA ATGTCCTCCC GCTTGACCTG A
|
Protein sequence | MVRVRKLALA IAAASVLSSG MAHALGLGEV TLRSALNQPL VAEIELLEAR DVGAEEIAPA LASPDAFDKA GVDRQHFLND LKFTPIIGPN GKRVIRVTST KPVREPYLNF LVEVLWPNGR LLREYTLLLD PPLYSSQTAM SAAQKTLPEQ RSGAVPRAPA AAAASRPNAS RQEATGAAVR KYRTVANDTL WKVAERVRTS GTIHQTMLAI QDLNPQAFLD GNINRLARGQ VLRLPDETQI RRRSASEALA EVAAQNAAWR EHRARPAVAS ARQLDATHRT EAGAAPARVE TGDSLRLVAA DGGKATAGSD QGADGKAAAD KLAVAKENLD ATQRENAELK SRMNDLQSQL DKLQRLIALK DEQLARLQAN LAQSDEAGGG ASADLAAGNP VQAQSAVVDA PAAGQAESAS DVPARMAVAS SDVEASAPAA PPAVVPEQSE AVSAVPVPPS AVVAPSVPPV AEPAADTPQQ PAPTEPAAPR PPQSTVASGP TALVDSILSN PLLLPSLGGG AAAALLLGLL VARRRAAKKT DFQDEDDEGG MTDTRLDAGL SRLPVEPTLL NGFSGAGPKR DAVSEAEEHI ACGRFKEAAE SLEAACAAEP GRSELRLKLM EVRAELGDRE GFARQERALR ETAGAQSQVD HLKMKYPAMA GFATIALAGS ALASGPESSE GAQERPSLAE ESEPPQLLAA DIGLRLDDLE AELERDLQHS AQDDATSTPG DLTQDEAPRL ASPVEEPLIE DFSFDLDLPD EAIDFELDED PEGISLAPLP LTGPATGDKA FPALDPGAPE LPVAALDETL SPEEAFLLEE GLFDGFELPV DDDFSIAPTP EASGFELPSS DGASEGQGVS GLVAQLDELD VELKRLAAGL GEGDTQPAAR SQGDPVESED FDFLADADET ATKLDLARAY IDMGDTEGAR DILEEVLNEG NEVQRQEARE MSSRLT
|
| |