Gene Avin_34230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_34230 
Symbol 
ID7762318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3494663 
End bp3497503 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content66% 
IMG OID643806285 
Producthypothetical protein 
Protein accessionYP_002800547 
Protein GI226945474 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGGG TTCGCAAACT GGCACTGGCA ATCGCGGCGG CCTCGGTGCT GTCTTCGGGC 
ATGGCCCATG CACTGGGCCT TGGAGAGGTG ACCCTGCGTT CCGCGCTGAA TCAGCCTCTG
GTTGCTGAAA TCGAACTGCT GGAGGCGCGC GATGTCGGCG CGGAAGAAAT CGCCCCGGCC
CTTGCTTCTC CCGATGCGTT CGATAAGGCG GGCGTCGATC GACAGCACTT TCTGAACGAC
CTGAAATTCA CGCCAATAAT CGGACCCAAC GGCAAGCGTG TCATTCGGGT GACCTCCACC
AAGCCGGTGC GCGAGCCTTA TCTGAATTTC CTAGTGGAAG TGCTTTGGCC CAATGGCCGG
TTGTTGCGCG AATATACGCT GCTGCTCGAT CCGCCGCTGT ATTCGTCGCA GACGGCCATG
TCCGCCGCGC AGAAAACCTT GCCCGAACAG CGCTCCGGTG CCGTCCCGCG TGCTCCGGCG
GCTGCCGCGG CGTCCCGGCC GAACGCTTCG CGTCAGGAGG CCACCGGCGC GGCAGTCAGG
AAATACAGGA CCGTCGCCAA CGACACCTTG TGGAAGGTCG CCGAGCGCGT GAGGACCTCG
GGCACCATCC ATCAGACCAT GCTGGCGATC CAGGATCTGA ATCCGCAGGC CTTTCTCGAT
GGCAACATCA ATCGTCTGGC GAGGGGGCAG GTTTTGCGCC TGCCCGACGA GACGCAGATC
CGCCGCCGCT CGGCAAGCGA GGCGCTGGCC GAGGTCGCCG CGCAGAACGC GGCTTGGCGC
GAGCATCGTG CCCGCCCGGC CGTCGCTTCC GCCCGGCAGT TGGATGCGAC CCATCGTACC
GAGGCCGGCG CCGCGCCGGC GCGCGTGGAG ACCGGCGATA GCCTGCGCCT GGTCGCCGCC
GACGGCGGCA AGGCGACTGC CGGTAGCGAC CAGGGAGCGG ATGGCAAGGC GGCTGCCGAC
AAGCTGGCAG TGGCCAAGGA AAATCTCGAT GCGACCCAGC GCGAGAATGC CGAGCTGAAG
AGCCGCATGA ATGACCTGCA GAGCCAGTTG GACAAGTTGC AGCGCCTGAT CGCTCTCAAG
GACGAGCAAC TGGCCAGGCT GCAGGCGAAC CTGGCGCAAT CGGACGAGGC TGGTGGAGGC
GCTTCCGCCG ATCTGGCCGC CGGCAATCCG GTGCAGGCGC AATCCGCTGT CGTCGACGCA
CCGGCAGCCG GCCAAGCGGA GTCCGCGAGC GACGTTCCCG CGCGGATGGC GGTTGCATCC
TCCGACGTAG AGGCATCTGC GCCCGCTGCA CCGCCAGCCG TCGTCCCTGA GCAGTCCGAA
GCCGTCTCCG CCGTTCCTGT CCCACCCTCC GCTGTCGTCG CGCCTTCCGT GCCGCCAGTG
GCCGAACCGG CCGCCGATAC CCCGCAGCAG CCCGCACCGA CCGAGCCGGC GGCTCCCCGG
CCGCCGCAGT CGACTGTCGC GTCGGGGCCG ACCGCTCTGG TCGACTCCAT CCTGAGCAAT
CCGCTGCTGC TTCCGTCACT GGGCGGCGGT GCCGCCGCCG CGCTGCTGTT GGGCTTGCTG
GTAGCCCGTC GACGTGCGGC GAAGAAGACC GACTTCCAGG ATGAAGATGA CGAGGGTGGC
ATGACTGATA CTCGGCTGGA TGCCGGCTTG TCGCGGCTGC CCGTCGAGCC CACCCTGTTG
AACGGGTTCT CCGGGGCAGG GCCGAAGCGG GATGCAGTGA GTGAAGCCGA AGAGCATATT
GCCTGCGGCC GCTTCAAGGA GGCTGCCGAG TCGCTGGAAG CTGCCTGCGC CGCCGAGCCA
GGGCGCAGCG AGCTGCGCCT GAAGCTGATG GAGGTTCGTG CCGAGCTTGG CGATCGTGAG
GGTTTCGCCC GTCAGGAGCG GGCGCTGCGC GAAACCGCTG GCGCCCAGTC ACAGGTCGAT
CACCTGAAGA TGAAGTATCC GGCCATGGCC GGTTTCGCCA CCATCGCTCT GGCAGGTTCC
GCCCTGGCAA GCGGACCGGA GTCGTCCGAA GGGGCGCAGG AGCGGCCTTC GCTGGCGGAA
GAGTCCGAGC CGCCCCAACT TCTTGCAGCC GATATCGGCC TGAGGTTGGA TGATCTGGAG
GCCGAGCTGG AAAGGGATCT TCAGCATTCC GCCCAGGACG ATGCTACATC GACTCCGGGC
GACCTGACGC AGGACGAAGC GCCGCGGTTG GCGTCGCCTG TGGAGGAGCC TTTGATCGAG
GATTTCTCCT TCGACCTGGA CCTTCCCGAT GAGGCCATCG ATTTCGAATT GGACGAAGAT
CCGGAGGGTA TCTCCTTGGC TCCCCTACCG CTGACCGGAC CAGCAACCGG GGACAAGGCA
TTCCCCGCGC TCGACCCGGG CGCCCCCGAG CTTCCCGTCG CTGCATTGGA CGAGACGCTT
TCCCCGGAGG AGGCGTTTCT TCTCGAGGAG GGGCTGTTCG ACGGCTTCGA GCTGCCCGTA
GACGACGATT TTTCCATCGC CCCCACTCCG GAGGCTTCGG GATTCGAGCT GCCATCAAGC
GATGGTGCAT CGGAGGGCCA GGGGGTGTCG GGCCTCGTGG CCCAACTGGA TGAGCTGGAT
GTCGAGCTCA AGCGACTGGC CGCCGGGCTG GGCGAGGGCG ATACGCAGCC GGCGGCTCGT
TCGCAAGGTG ATCCGGTGGA GAGCGAGGAC TTCGATTTCC TGGCCGATGC GGACGAAACC
GCCACCAAGC TGGATCTGGC TCGCGCCTAT ATCGACATGG GCGATACCGA AGGCGCCCGC
GATATCCTTG AGGAAGTATT GAACGAAGGG AATGAAGTTC AGCGGCAGGA AGCGCGTGAA
ATGTCCTCCC GCTTGACCTG A
 
Protein sequence
MVRVRKLALA IAAASVLSSG MAHALGLGEV TLRSALNQPL VAEIELLEAR DVGAEEIAPA 
LASPDAFDKA GVDRQHFLND LKFTPIIGPN GKRVIRVTST KPVREPYLNF LVEVLWPNGR
LLREYTLLLD PPLYSSQTAM SAAQKTLPEQ RSGAVPRAPA AAAASRPNAS RQEATGAAVR
KYRTVANDTL WKVAERVRTS GTIHQTMLAI QDLNPQAFLD GNINRLARGQ VLRLPDETQI
RRRSASEALA EVAAQNAAWR EHRARPAVAS ARQLDATHRT EAGAAPARVE TGDSLRLVAA
DGGKATAGSD QGADGKAAAD KLAVAKENLD ATQRENAELK SRMNDLQSQL DKLQRLIALK
DEQLARLQAN LAQSDEAGGG ASADLAAGNP VQAQSAVVDA PAAGQAESAS DVPARMAVAS
SDVEASAPAA PPAVVPEQSE AVSAVPVPPS AVVAPSVPPV AEPAADTPQQ PAPTEPAAPR
PPQSTVASGP TALVDSILSN PLLLPSLGGG AAAALLLGLL VARRRAAKKT DFQDEDDEGG
MTDTRLDAGL SRLPVEPTLL NGFSGAGPKR DAVSEAEEHI ACGRFKEAAE SLEAACAAEP
GRSELRLKLM EVRAELGDRE GFARQERALR ETAGAQSQVD HLKMKYPAMA GFATIALAGS
ALASGPESSE GAQERPSLAE ESEPPQLLAA DIGLRLDDLE AELERDLQHS AQDDATSTPG
DLTQDEAPRL ASPVEEPLIE DFSFDLDLPD EAIDFELDED PEGISLAPLP LTGPATGDKA
FPALDPGAPE LPVAALDETL SPEEAFLLEE GLFDGFELPV DDDFSIAPTP EASGFELPSS
DGASEGQGVS GLVAQLDELD VELKRLAAGL GEGDTQPAAR SQGDPVESED FDFLADADET
ATKLDLARAY IDMGDTEGAR DILEEVLNEG NEVQRQEARE MSSRLT