Gene Avin_36280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_36280 
Symbol 
ID7762523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3691703 
End bp3694624 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content66% 
IMG OID643806495 
Productlipoprotein 
Protein accessionYP_002800750 
Protein GI226945677 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCTCT TCGAACGATT CCGCGGCCGC GCCCGGATGG CGGAGAAAGT CGAGCCCGAG 
GTCCAGATCG CCGACGTCGC TCAAGAGAGT CATGAGGCTC ATGAGGCTCA TGAGGCTCCC
GACATCGAGA GGATCGAAGC CGCCACCGAG CGCCATCTCG AACGGCTGAC CCGCCTCGGC
ATCCCCGATC CGACGGACTG GCGAGATCCC GGCAAACGCC CGGCGACCCA GGCGGACGTC
GCCAGGCTCT ACGAGGTGGC GCCCTCCTTC GTCGACCTGT TGCCCTGGGT GGAATACCTG
CCGGACGAGC AGGCGATGCT CCTCGAGGAC GGCCATTCGT GGGCAGCCTT CTTCGAACTG
ACGCCGATCG GCACCGAAGG GCGCGACCCG GCCTGGCTGC GGATGGTTCG GGACGCCCTG
GAAAACGCCC TGCAGGACAG TTTCGATGAA CTCGACAGCT CGCCGTGGGT GGTACAGATG
TATGCCCAGG ACGAAACCAA CTTCGACGAT TACCTGCAGT CCCTGCGCAA CTACATCCAG
CCCAGGGCCG CGGGGAGCGA CTTCACCGAG GTCTACCTGG CGTTGTTCAA GCGGCATCTG
GAGGCGATCG CCAAGCCCGG CGGATTGTTC GAAGACACGG CCGTCAGCCA GTTGCCCTGG
CGCGGCCAGC AGCGCCGGGT CCGCCTGGTG GTCTACCGCC GCGTCCAGGT ATCGGACATG
GTCGTGCGTG GCCAGGCGCC CGCCCCCTAT CTCGCGGTGA TCTGCGATCG CCTGGTCGGT
GCCCTGGCCA ACGCCGGCGT GGTCGCCAGA CGCATGGATG GCCAGGCCGT GCGCACCTGG
CTCGTCCACT GGTTCAATCC CCGCCCCGAC CACCTGGGCG CGACGGATGC CGACATCCGC
CGCTTCCTCG AACTGGTCTG CCAAGCCCCG GAGGGACTCT CCGAAGACGA CTTGCCGCTG
GCCAGCGGCA CCGACTTCGC CCAGAACCTC TTCTACCGGG AACCGCAGTC CAGCACGCCC
AAAGGGCTCT GGTATTTCGA TGCGATGCCG CATCGCGTGG TGGTCGTCGA CCGGCTGCGC
GATGCGCCGA AGACCGGCCA CCTGACCGGC GAGACCCGCA AGAGCGATGC CTACAACGCC
CTGTTCGACC GCCTGCCGGA AGACACCGTG CTCTGCCTCA CGATGGTCGC CACGCCCCAG
GACCTGCTGG AAGGCCACCT GGAACAGCTC TCGAAGAAAG CGGTGGGCGA TACGCAGGCC
TCGATCCACA CGCGCGACGA TGTGGAGCAC GTCCGCACAC TGCTCGGCCG CAAGCACAAG
CTCTATCGGG GGAACCTCGC CTTCTATCTG CGGGGGACAG ACCACAGGCA GCTCGACGAA
CGGACCACCC AGCTCAGCAA CGCCCTGCTC GGTGCCGGCA TGTCGCCGGT CCAGCCGCAG
GACGAGGTCG CCCCGCTGAA CAGTTACCTG CGCTGGCTGC CCTGCAACTT CGACCCCAAC
GAGCGGCATG CCCTGGACTG GTACGTCCAG TTCATGTTCG TCCAGCACAT CGCGAATCTG
GCACCGATCT GGGGACGCTC GACGGGCACC GGCCACCCCG GTGTCACCCT GTTCAACCGC
GGTGGCGCGC CGGTCACCTT CGACCCGCTG AACCGCCTCG ATCGCCAGAT GAATGCCCAC
CTGTTCATCT TCGGTCCGAC CGGCTCGGGC AAATCGGCTT CGGCGACCAA CATCCTCAGC
CAGGTCATCG CGATCTACCG GCCTCGCCTG TTCATCGTCG AGGCGGGCAA CAGCTTCGGC
CTGCTGGGCG AGTTCGCGAA AAACCTGGGG CTCAGCGTGA ACCGCGCGCG TCTCGCTCCC
GGTTCGGGCG TCAACCTGGC GCCCTTCGCC GATGCGATCA AGCTCATCGA AGCCCCCGGC
AAGACGAAGG TCCTGGACAT GGAGGCCTCC GACGATCACC TGGCCGACAC CGAGGACGAG
CAGCGGGACA TCCTCGGCGA GATGGAAATC ACCGCCCGCC TGATGGTCAC CGGCGGGGAG
GCGAAAGAGG ACGCGCGACT GACGCGCGCC GACCGCAGCG CCCTGCGCCA GTGCATCCTG
GACGCGGCCA AGACCTGCTC CGAGCAGGGA AAGACCGTCC TCCCGGAGGA TGTGCGCGAT
GCTCTGCGCC GCATGGCGGC CGACGAGAAA ATACTTGAAC CCCGGCGCAA TCGCCTGATG
GAAATGGCCG AGGCCATGTC CATGTTCTGC ATGGGCGCCG AAGGCGAAAT GTTCAACCGG
CCGGGCAGCC CCTGGCCCGA GGCGGACCTC ACCATCGTCG ACCTGGCGAC CTACGCCCGC
GAAGGCTACG AAGCGCAGAT GGCCATCGCC TACATCTCGC TGCTCAACAC GGTGAACAAC
ATCGCCGAGC GCGACCAGTT CAAGGGGCGG CCGCTGATCT TCTTCACCGA CGAAGGCCAC
ATCCAGCTCA AGGTGCCGCT GCTCTCTCCC TACGCGGTCA AGATCACCAA GATGTGGCGC
AAGCTCGGCG CCTGGTTCTG GATGGCCACC CAGAACGTGG ACGACGTCCC CCCGGAGGCC
TCCGCCCTGC TGAACATGAT CGAGTGGTGG ATCTGCCTGA ACATGCCGCC GGACGAGGTG
GAGAAGATCG CCAGGTTCCG CGAACTGACG CCGGCGCAGA AGGCGATGAT GCTCTCGGCC
CGCAAGGAGA ACGGCAAGTT CACCGAAGGC GTCGTGCTCG CCAAGCGCCT CGAACTGCTG
TTCCGGGTGG TTCCGCCCAG CCTGTACCTG GCGCTCGCGA TGACCGAGCC GGAAGAAAAA
AAGCAGCGCT ACGACATCAT GATGGCCAAG GGCTGCGACG AACTCGGCGC CGCCCTGGAA
GTCGCCGCCG ACCTGGACCG CAAACGCGGC ATCACCTCCT GA
 
Protein sequence
MGLFERFRGR ARMAEKVEPE VQIADVAQES HEAHEAHEAP DIERIEAATE RHLERLTRLG 
IPDPTDWRDP GKRPATQADV ARLYEVAPSF VDLLPWVEYL PDEQAMLLED GHSWAAFFEL
TPIGTEGRDP AWLRMVRDAL ENALQDSFDE LDSSPWVVQM YAQDETNFDD YLQSLRNYIQ
PRAAGSDFTE VYLALFKRHL EAIAKPGGLF EDTAVSQLPW RGQQRRVRLV VYRRVQVSDM
VVRGQAPAPY LAVICDRLVG ALANAGVVAR RMDGQAVRTW LVHWFNPRPD HLGATDADIR
RFLELVCQAP EGLSEDDLPL ASGTDFAQNL FYREPQSSTP KGLWYFDAMP HRVVVVDRLR
DAPKTGHLTG ETRKSDAYNA LFDRLPEDTV LCLTMVATPQ DLLEGHLEQL SKKAVGDTQA
SIHTRDDVEH VRTLLGRKHK LYRGNLAFYL RGTDHRQLDE RTTQLSNALL GAGMSPVQPQ
DEVAPLNSYL RWLPCNFDPN ERHALDWYVQ FMFVQHIANL APIWGRSTGT GHPGVTLFNR
GGAPVTFDPL NRLDRQMNAH LFIFGPTGSG KSASATNILS QVIAIYRPRL FIVEAGNSFG
LLGEFAKNLG LSVNRARLAP GSGVNLAPFA DAIKLIEAPG KTKVLDMEAS DDHLADTEDE
QRDILGEMEI TARLMVTGGE AKEDARLTRA DRSALRQCIL DAAKTCSEQG KTVLPEDVRD
ALRRMAADEK ILEPRRNRLM EMAEAMSMFC MGAEGEMFNR PGSPWPEADL TIVDLATYAR
EGYEAQMAIA YISLLNTVNN IAERDQFKGR PLIFFTDEGH IQLKVPLLSP YAVKITKMWR
KLGAWFWMAT QNVDDVPPEA SALLNMIEWW ICLNMPPDEV EKIARFRELT PAQKAMMLSA
RKENGKFTEG VVLAKRLELL FRVVPPSLYL ALAMTEPEEK KQRYDIMMAK GCDELGAALE
VAADLDRKRG ITS