Gene Avin_52330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_52330 
Symbol 
ID7764068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5346025 
End bp5348796 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content58% 
IMG OID643808046 
ProductType III restriction enzyme, res subunit 
Protein accessionYP_002802280 
Protein GI226947207 
COG category[S] Function unknown 
COG ID[COG3421] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAC GTGTTAAGAA TCACGTCACT GGCCGCCTGT CGCTGCGGCC GCCGCAAGCC 
GAATCCCTGG CACGGCTGGT ACGCGCGCTG GAAAGCGCGC CGGAAATGCT GGGCAAGGAT
AGGGACGTTG CCTCGATTCT GGCGACCCTG AAAGCCGAAT TTCCGATGCT GGAGGACTTC
GAGCGGGACT TCCCTTCGCT GTGCTTCGCC TTGGCTACCG GAGTCGGCAA GACGCGCTTG
ATGGGGGCTT TTGTCGCCTA CCTGCATCTG GCACACGGCA TCAATAATTT CTTCGTGCTG
GCGCCGAATC TCACCATCTA TAACAAGTTG ATTGCCGATT TCACGCCGAA CACGCCCAAG
TACGTGTTCA AGGGCATCGG GGAGTTCGCC ATCAACGCGC CCAGGGTCAT CACCGGCGAC
AACTACGACC AGCAAAACGT CGCGGGTGGC GAGTTGTTCG GCGAGGTGCG GATCAACATC
TTCAATATCT CCAAGATCAA TTCCGAGGTG CGCGGCGGCA AGGAGCCACG CATCAAGCGC
ATGCGCGAGG TGCTGGGCGA AAGCTACTTC AACCACCTGG CCAACCTGCC GGACCTGGTG
TTGTTGATGG ACGAATCACA CCGCTATCGT GCCCAGGCCG GTATGCGGGC GATCAACGAA
TTGCACCCGC TGTTCGGATT GGAGGTGACG GCCACGCCCT TTGTGGAATC CGCCAAGGCC
CCCATACCTT TCAAGAATGT GGTGATGGAC TATCCCTTGG CGCGCGCCAT GGAAGATGGC
TTCGTCAAAG AACCGGCCGT CGTCACCCAA CGCAACTTCA AGGCCAGCAA CCACGCTTCG
GAAGAAATCG AGAAGATCAA GCTGGAGGAC GGCGTTCGCC TGCACGAAGC CACCAAGGTC
GAGCTATTGA CCTATGCCCG CGAGAACGGC GTCAAGGTTG TCAAACCCTT CATCCTGGTG
ATCGCGCGCG ATACGACGCA TGCCGGGCAG TTGAAGGCAT TGATCGAGTC CTCAGCATTC
TTTGAAGCGC GCTACCAGGG CAAGGTCATT CAGGTGGACT CCAGCCGCAC CGGAGCCGAG
GAAGAGAAGA TGATCGAGGC GTTGTTGAAT GTGGAGAATC CGGAAGAACC CACCGAGATC
GTTATCCACG TCAACATGCT CAAGGAAGGC TGGGACGTCA CCAACCTCTA CACCATCGTG
CCGCTACGTG CCGCCAATGC ACGCACGTTG ATCGAGCAAT CCATCGGGCG TGGCCTGCGC
CTGCCTTATG GCAAGCGTAC GGGCGTGGCT GCGGTGGATC GCCTCAACAT CGTGGCGCAC
GACAAGTTCC AGGAGATCAT CGACGAGGCC AACCGTGGCG ATTCGCCGAT CCGTCTCAAG
CAAGTGATCC TGGAAGCGCC CAGTAGCGAG GACAAAAAGG TCAGCGTTCA AGTGCTGCCC
AATCTGCTGA CCCAACTTGG TCTGCACGAC GAGCATGCGC CACATGTCCC GCCTGCGTTG
GCAACGGTGG ATGCCGGTAC TGAGGTTGGT GGCGAACAAG TCAACGCACA AACTCAGCCA
GTCTTTTCGA CCGAAGCAGA ATTCAAGGCG GCCCATGTCG TACGGGAAGT CCTTGCCACC
TATGAGGTGA AACGTGACTT GGCACCGACC AGTGCGGCAT TGCTGAAGCC GGAAATCCAG
CAAGAGATTC TGGCAGAAGT GGAAAAGCGC CTGAACCCCC AACAGGGGCA ATTGCTGCAA
GGCGCCGACG ACCAGGTGCC AGCGCTGGAT CTGTCCGCCG TCGTGGCCAA GACCACTGAA
ATTCTGGTTC AGCAGACCAT CGACATTCCG CGTATCGCCG TTGTGCCTAC CGGCGAAGTC
ACCACAGGCT TTCATCCGTT CCGCCTGGGG GCCTTGCCCA ACTTCCAGCC AGGGCAGCGC
GAGATCGTCG GCCAGACATT GCGTACCAGC GAACAATTCA CCCTGAACCG TGAAAGTGGC
CTGCGGGAAA ACCGCTTTGA AGATTACATC GTCAAGAAGC TGATCGACTT CGACGACATC
GACTACTTCA CGCAGGCCGA CTTGCTCTAC GACCTCGCCG GACAGGCTGC CGAGCATTAC
CAGCAGCAGA ACTATGCGGA CAGTGAACTG CACGAGATTT TCGATACCTA CGGCAAGGAG
CTCGCCCGCC TGATTCGTGC CCAGATGATG GAGCACTTCT GGGAAAAAGC CGCGGGCTAT
GAGGTTCAGG TCAGCAGAGG CTTCACCGAG CTGAAACCAT GTAACTACAC CGCTACGGAG
GGGCAGACGG CCCATAACCT GAGGGAAACC GTTACGGAAA CCAGCCGTAT CAAACAGATG
CTGTTCGGTG GGTTCACCCG TTGCCTGTAC CCCTTGCAGA AGTTCGACTC GGACACCGAA
CGGCGTTTTG CCTTGCTGCT GGAACGTGAC GCCCTCAAAT GGTTCAAGCC TGCCAAGGGC
CAGTTCCAGA TCTACTACAA GCTGGGCAGC GAACAGCCGG AATACATCCC CGACTTCGTG
GCCGAGTTGG ATGGAATGAT CCTGATGGTC GAAACCAAAG CGCGTGTCGA TCTGGCTTCC
GCTGAAGTCC AGGCCAAGAG CGCTGCCGCC TCTCGATGGT GTCGGCACGC CAGTGAACAT
GCGGCTGAAG TCGGCGGTAA GTCCTGGCGC TACCTCGTGG TTCCCCATGA TGAAGTCACT
GAAGACAAAC GCCTGTCCGA CTACCTGCGG TTTGAAGTCA AGGACACTGC GGAAGGAGGC
AGCCCAGCCT GA
 
Protein sequence
MSSRVKNHVT GRLSLRPPQA ESLARLVRAL ESAPEMLGKD RDVASILATL KAEFPMLEDF 
ERDFPSLCFA LATGVGKTRL MGAFVAYLHL AHGINNFFVL APNLTIYNKL IADFTPNTPK
YVFKGIGEFA INAPRVITGD NYDQQNVAGG ELFGEVRINI FNISKINSEV RGGKEPRIKR
MREVLGESYF NHLANLPDLV LLMDESHRYR AQAGMRAINE LHPLFGLEVT ATPFVESAKA
PIPFKNVVMD YPLARAMEDG FVKEPAVVTQ RNFKASNHAS EEIEKIKLED GVRLHEATKV
ELLTYARENG VKVVKPFILV IARDTTHAGQ LKALIESSAF FEARYQGKVI QVDSSRTGAE
EEKMIEALLN VENPEEPTEI VIHVNMLKEG WDVTNLYTIV PLRAANARTL IEQSIGRGLR
LPYGKRTGVA AVDRLNIVAH DKFQEIIDEA NRGDSPIRLK QVILEAPSSE DKKVSVQVLP
NLLTQLGLHD EHAPHVPPAL ATVDAGTEVG GEQVNAQTQP VFSTEAEFKA AHVVREVLAT
YEVKRDLAPT SAALLKPEIQ QEILAEVEKR LNPQQGQLLQ GADDQVPALD LSAVVAKTTE
ILVQQTIDIP RIAVVPTGEV TTGFHPFRLG ALPNFQPGQR EIVGQTLRTS EQFTLNRESG
LRENRFEDYI VKKLIDFDDI DYFTQADLLY DLAGQAAEHY QQQNYADSEL HEIFDTYGKE
LARLIRAQMM EHFWEKAAGY EVQVSRGFTE LKPCNYTATE GQTAHNLRET VTETSRIKQM
LFGGFTRCLY PLQKFDSDTE RRFALLLERD ALKWFKPAKG QFQIYYKLGS EQPEYIPDFV
AELDGMILMV ETKARVDLAS AEVQAKSAAA SRWCRHASEH AAEVGGKSWR YLVVPHDEVT
EDKRLSDYLR FEVKDTAEGG SPA