Gene Avin_28020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_28020 
Symbol 
ID7761707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2889430 
End bp2891634 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content60% 
IMG OID643805681 
Producthypothetical protein 
Protein accessionYP_002799949 
Protein GI226944876 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03549] conserved hypothetical protein TIGR03549 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.458293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTA AGGTTAATTT TCTCGATAAC CTTCGGCTAG AAGCCAGGTT CGATGATTTC 
ACGGTGATCG CCGACCAGCC GATCCGCTAC AAAGGCGATG GCTCCGCACC GGGTCCGTTC
GATTACTTCC TGGCTTCATC GGCTTTGTGC GCGGCTTACT TCGTAAAGTT GTACTGCCAG
ACGCGCGATA TCCCCACCGA TAATATCCGC CTGTCGCAGA ACAACATTGT CGATCCGGAG
AATCGCTACA AGCAGATCCT CAAGATCCAG GTGGAGTTGC CGGCGGATAT CTCGGAGAAG
GATCGCCTGG GCATCCTGCG TTCCATCGAC CGCTGCACGG TGAAGAAGGT CGTGCAGACC
GGGCCTGACT TCGTGATCGA GGAAGTGGAA AACCTCGATG CCGATGCCCA GGCGTTGTTA
ATGCTCAACC CCGACGCCGA CGCGAGCACC TACATCCTGG GCAAGGATCT GCCGCTGGAG
CAGACCATCG CCAACATGTC GGAAATCCTC GCCGGCCTGG GCATGAAGAT CGAGATCGCG
TCGTGGCGCA ACATAGTGCC CAACGTGTGG TCGCTGCATA TCCGCGATGC GCAATCGCCG
ACGTGCTTCA CCAACGGCAA GGGTGCGACC AAGGAAAGCG CGCTGGCATC GGCTCTGGGC
GAATTCATCG AGCGCCTGAA CTGCAATTTC TTCTACAACG ACCAGTTCTG GGGAGAAGAC
ATCGCCAATG CGGCGTTCGT GCACTACCCG GATGAGCGCT GGTTCAAGCC TGGCCGCAAG
GATGCGCTGC CGGCTGAAAT CCTCGACGCG TACTGCCTGG AAATCTACAA CCCCGACGAC
GAGTTGCGTG GTTCGCACCT GTACGACACC AACTCCGGCA ATGTGGAGCG CGGCATCTGT
TCGCTGCCGT TCGTGCGCCA GTCCGACGGC GAGGTGGTGT ACTTCCCGTC CAACCTGATC
GAGAACCTGT ACCTCAGCAA TGGCATGAGT GCCGGCAATA TGCTGGCCGA AGCGCAGGTG
CAGTGCCTGT CGGAAATCTT CGAGCGCGCG GTGAAGCGTG AAATCCTCGA AGGTGAACTC
GCCCTGCCCG ATGTGCCGCA CGAGGTGCTG GCGAAGTACC CGGGCATCCT GGCCGGTATC
CAGGGCCTGG AAGAACAGGG CTTCCCGGTG CTGGTCAAGG ATGCGTCGCT GGGCGGCGAG
TTCCCGGTGA TGTGCGTGAC CTTGATGAAC CCGCGTACCG GCGGCGTGTT CGCCTCGTTC
GGCGCGCACC CGAACTTCGA GGTGGCACTG GAGCGCAGCC TGACGGAGTT GCTGCAGGGC
CGCAGCTTCG AAGGCCTGAA CGACCTGCCG GCGCCGACCT TCGAAAGCCA CGCCTTGACC
GAGCCGAACA ACTTCGTCGA ACACTTCATC GACTCCAGCG GCGTGGTGTC GTGGCGCTTC
TTCAGCGCCA AGGCCGACTT CGAATTCGTC GAGTGGGACT TCTCCGGCCA GGGGGAAGAT
TCCAATGCCG AGGAAGCCGC CACCCTGTTC GGCATTCTCG AAGACATGGG CAAGGAAGTG
TACATGGCGG TGTACGAGCA CCTGGGGGCC ACGGCGTGCC GCATCCTGGT GCCAGGGTAT
TCGGAGATCT ATCCGGTAGA GGATCTGATC TGGGATAACA CCAACAAGGC GTTGTCGTTC
CGCGAGGACA TCCTGAACCT GCACCGCCTG GACGATGCCC GCCTCAAGGC ACTGCTCAAG
CGTCTGGAAA ACTGCGAGGT GGATGACTAC ACCGACATCA CCACCCTGAT CGGTATCGAG
TTCGACGACA ACACGGTCTG GGGCCAGTTG ACCCTCCTCG AACTGAAACT GCTGATCAGC
CTCGCCCTGC GTCGCTTCGA AGACGCGAAG GAACTGGTGG AAGCCTTCCT GCAGTACAAC
GACAACACGG TCGAGCGGGG GCTGTTCTAC CAGACCCTGA ACGTAGTGCT GGAAGTGGTG
CTGGACGAAG AGCTGGAGCT GGCCGACTAC GAGGTCAACT TCCGCCGGAT GTTCGGCAAC
GAGCGGATGG ACGCGGCGCT GGGGTCGGTG GATGGCAGCG TGCGCTTCTA CGGTCTGACG
CCGACCAGCA TGAAGCTGGA AGGGCTCGAC AGGCACCTGC GCCTGATCGA CAGCTACAAG
AAGCTGCACG GGGCGCGGGC CAGAGTGGCG GCTTTATCCC AATAA
 
Protein sequence
MEIKVNFLDN LRLEARFDDF TVIADQPIRY KGDGSAPGPF DYFLASSALC AAYFVKLYCQ 
TRDIPTDNIR LSQNNIVDPE NRYKQILKIQ VELPADISEK DRLGILRSID RCTVKKVVQT
GPDFVIEEVE NLDADAQALL MLNPDADAST YILGKDLPLE QTIANMSEIL AGLGMKIEIA
SWRNIVPNVW SLHIRDAQSP TCFTNGKGAT KESALASALG EFIERLNCNF FYNDQFWGED
IANAAFVHYP DERWFKPGRK DALPAEILDA YCLEIYNPDD ELRGSHLYDT NSGNVERGIC
SLPFVRQSDG EVVYFPSNLI ENLYLSNGMS AGNMLAEAQV QCLSEIFERA VKREILEGEL
ALPDVPHEVL AKYPGILAGI QGLEEQGFPV LVKDASLGGE FPVMCVTLMN PRTGGVFASF
GAHPNFEVAL ERSLTELLQG RSFEGLNDLP APTFESHALT EPNNFVEHFI DSSGVVSWRF
FSAKADFEFV EWDFSGQGED SNAEEAATLF GILEDMGKEV YMAVYEHLGA TACRILVPGY
SEIYPVEDLI WDNTNKALSF REDILNLHRL DDARLKALLK RLENCEVDDY TDITTLIGIE
FDDNTVWGQL TLLELKLLIS LALRRFEDAK ELVEAFLQYN DNTVERGLFY QTLNVVLEVV
LDEELELADY EVNFRRMFGN ERMDAALGSV DGSVRFYGLT PTSMKLEGLD RHLRLIDSYK
KLHGARARVA ALSQ