Gene Avin_04540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04540 
Symbol 
ID7759412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp429787 
End bp432033 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content68% 
IMG OID643803376 
Producthypothetical protein 
Protein accessionYP_002797684 
Protein GI226942611 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGC TCGGCAAAAT CCTTGGCCTG CTCGTCCTCG GGCTGCTGCT GGTCGTAGTC 
GCCCTCGGCT TCGCCCTTAC TCAGTTGTTC GATCCCAACG ACTACAAGGA CGAAATCCGC
CAGCTCGCCC GCGACAAGGC CAATCTGGAA CTGACCCTGA ACGGCGACAT CGGCTGGAGT
CTCTTCCCCT GGCTGGGCCT GCAGTTGCAG CAGACCAGCG TGGCCAGCGC CCGCACCCCC
GACCAGCCGT TCGCCGACCT GGACATGATC GGCCTGTCGG TACGAGTGCT GCCGCTGCTG
CGCCGGGAGA TCCAGATGAG CGACATCCGC GTCGACGGCC TCGAACTGCA CCTGCGCCGC
GACAAGCAGG GCCGCGGCAA CTGGGAGGAC ATCGGCCGTC CGGCACGGAC GGCCGGGCAG
CCGGAGCCGG AGGCCACCGC CGGGACAGAC ACGGCACCGG CCGAAGCGCC CGCCGCGGCC
GATGCGGCCG ATGCGGGCAG CGCCCGGGCG CTGCGCCTGG ACATCGACAG CCTGGCGCTG
AACAACGCCC GGGTGGACTA CCTGGACGAG CGCAGCGGCC AGAAATTCAG TGCCGAGAGC
ATCCAGCTCA ACAGCGGCGC GATACGCCCG GACAGCGACA TCCCCCTCAA GCTGACCGCC
TTCTTCGGCA GCAACCAGCC GGTCATGCGC GCGCGCACCG ATCTCGAGGG CGTGCTGCGC
TTCGACTCCG CGCTCCAGCG CTTCCAGTTG GGCAACGCCC GGCTGTCCGG CGAAGCCTCG
GGCGAACCGC TCAAGGGCAA GAGCCTGAAC TTCGCCGCCC AGGGCGAGTT GCTCGCCGAC
CTGGCCGCCC AGGTGGCCGA ATGGAACGGC CTGAAGCTCA GCGCCAACCA GTTGCGCGCC
ATCGGTGAAC TGAAGGTGCG CGAGCTGGAC AAGATCCCGC AACTGAGCGG CGGCCTGTCC
ATCGCCCAGT TCAACCCGCG CGAATTCCTC GCCGGCCTCG GCCAGGAGCT GCCGGCGACG
GCCGATCCCG CCAGCCTGAC CCGCCTCGAA CTGACCACCC AGTTGGGTGG CTCGCCGAGC
GCCGTGGCCT TCGACAAGCT CGACCTGAAG CTGGACGACA GCCAGTTCAC CGGCCGCATC
GCCGTCGCCG ACCTCGCCCG CCAGGCCCTG CGCGTACAGC TCAAGGGCGA CAAGCTGGAC
CTCGACCGCT ACCTGGCGCC GAAGAGCGAG AAGCGAGAGG CCGCCGGCGC CGCGCGCCAG
GCCGAAGTCA AGGGCGCCGT GGCCAGCGCC ATCCAGAACA GCGACACCCC GCTGCCCGAT
GCCCCCACCC AGCAGGCTTG GAGCGAAGAC CCCGTGCTGC CGGTCGACAC CCTGCGCAAA
CTGGACCTGC AGGCCAGCCT CGACATCGGC CAGTTGACCA TCGAGCGCCT GCCGGTCGGC
AACGCCCACC TGCAGGCCAG CGCCAAGGAC GGCCTGCTGA CCCTGCAGAG CCTGCGCGGC
GAACTCTTCG GCGGCGGCTT CGAGACGCAG GCCCGCCTTG ACGCCCGTCC GTCCACGCCC
CTGCTGAGCC TGCAGCAGCG CCTCAGCCGG ATTCCGGTGG AGAAATTCAT CCGGACGGAA
GGCAAGGAAA CGCCTCCCAT CAAGGGCCAG CTCGATCTGA ACGCCGACCT GGAAACCCGC
GGTAACAGCC AGAAGGCCTG GATCGAAGGA CTCAACGGCA CGGCCAGCTT CGCCCTCCAC
AAAGGCGTGC TGCCCGAAGC CAACCTCGAA CAGCAGCTCT GCCTGGCCAT CGCCACCCTC
AACCGCAAGG GCCTGAACAA TCCGCCCAAG GCCAAGGACA CGCCCTTCGA GGAACTCAAG
GGCAACCTGC GCTTCACCAA CGGCGTCGCC AGCAACCCCG ACCTCAAGGC GCGCATTCCC
GGCCTGACGG TCAACGGCAA CGGCGACCTG GACCTGCGCG TGCTCGGCAT GAACTACCGC
ATCGGCGTGA TCATTGAAGG CGACAGGCGC GCCATGCCCG ATGCCGCCTG CCAGGTCAAC
GAGCGCTATG TCGGCCTGGA ATGGCCGGTC CGCTGCCGCG GCCCGCTGGA ACTGGGCGCC
AAGGCCTGCC GCCTGGACCG CGACGGCCTG CGCCAGATCG CCGCCAAGCT GGCCGGCGAG
AAGCTCAACG AGAAGCTCGA AGAAAAACTC GGCGACAAGG TCAGCCCGGA ACTCAAGGAT
GCGCTGAAGG GGCTGTTCAA CCGATGA
 
Protein sequence
MKTLGKILGL LVLGLLLVVV ALGFALTQLF DPNDYKDEIR QLARDKANLE LTLNGDIGWS 
LFPWLGLQLQ QTSVASARTP DQPFADLDMI GLSVRVLPLL RREIQMSDIR VDGLELHLRR
DKQGRGNWED IGRPARTAGQ PEPEATAGTD TAPAEAPAAA DAADAGSARA LRLDIDSLAL
NNARVDYLDE RSGQKFSAES IQLNSGAIRP DSDIPLKLTA FFGSNQPVMR ARTDLEGVLR
FDSALQRFQL GNARLSGEAS GEPLKGKSLN FAAQGELLAD LAAQVAEWNG LKLSANQLRA
IGELKVRELD KIPQLSGGLS IAQFNPREFL AGLGQELPAT ADPASLTRLE LTTQLGGSPS
AVAFDKLDLK LDDSQFTGRI AVADLARQAL RVQLKGDKLD LDRYLAPKSE KREAAGAARQ
AEVKGAVASA IQNSDTPLPD APTQQAWSED PVLPVDTLRK LDLQASLDIG QLTIERLPVG
NAHLQASAKD GLLTLQSLRG ELFGGGFETQ ARLDARPSTP LLSLQQRLSR IPVEKFIRTE
GKETPPIKGQ LDLNADLETR GNSQKAWIEG LNGTASFALH KGVLPEANLE QQLCLAIATL
NRKGLNNPPK AKDTPFEELK GNLRFTNGVA SNPDLKARIP GLTVNGNGDL DLRVLGMNYR
IGVIIEGDRR AMPDAACQVN ERYVGLEWPV RCRGPLELGA KACRLDRDGL RQIAAKLAGE
KLNEKLEEKL GDKVSPELKD ALKGLFNR