Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26580 |
Symbol | |
ID | 7761566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2719249 |
End bp | 2721867 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805536 |
Product | Pentapeptide repeat protein |
Protein accession | YP_002799809 |
Protein GI | 226944736 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCA TCAAACCCCT GAGGCTCGGC GTGCTGCACC GCACCTATCA CTGGCGGCAC GGCCATCGCC TCGCCGTGAC GGCGATCGCC CTGGCGACCC TGGAGGAGTC TCCGGTGCTG CTACCCGAGC AGGAACTCTG GGCGTTGCTC GACGAGGCGC TGGACGAGAA CGAGCAGATC GACCTGCTGA TGCCCAAGCC ATGTCCGGAA TTCCTGGTCA ACGGTCACGC CTACAACGTC CACGGGACGG ACCGCCGGCG CTGCCGGGTG CAGGCGCGGC TGGACGACCG CTGCAAGGCC CTGGTGGTCC ATGGCGACCG GTACTGGGCG GAAGGCGAGG CCGGCGAGCC GGCTGACTTC GAGGCCATGC CGCTCGGCCG GCGACGGGCC TATGGCGGCC CCGGTTACGA GGCCAATCCC ATCGGCATCG GCCATGTCCC GGAGTCGGTC GACGGGGGCG AGCGCTGGCC CTTGCCGAAC GTCGAGCACG ACGGACAGCC GCCGTTCGCC CCCGGCCAGC CGCCCGGGAC GCCGGCGGGT TTCGGCATGC GCGACATCGA TGCGCCGGCG CACCGGGCGA AGCTGGGCCA GTACGACGAA GAGACGCTGG AGCGGGACGG TCCGGGGCTG GCGGAGTCGT TCGACTGGCG CTTCTTCAAT CTGGCGCCGG ACGACCAGCA ATGGCCGGAT CGCGACCGCC TGGCGGGCGG CCTGGAGTAC GAATTCCTCA ACCTGCACCC CGAACTGGCG CGCCTCGCCG GGCACCTGCC GGACGGATTG GCGCGCTGCT TCGTCATGCG CCAGCCGGCC GAAGGCGAAT CCGCCCTGGA AGAGATTCCG CTGCGTCTCA CCACGGCCTG GTTCTTTCCC GACCGGCGGC GCGTGGCGCT CATCCACCAT GGCGACCTGG CCGTCGACGA CGAGCAGGCT TCCGACATCC AGTACCTGAT GCCGGCCTTC GAGGCCGGCG CCGCGCCGCG TGGCCTGGAG CACTACGCCG AGGCGCTGGC GCGGCGGCTG GACGAGGAGG AGGGCGACCT GTACGCCTTC GACGAGGCGG CGCTGATCCA CGAACCCTTC ATCGGCGCCG GCTTCGATAC CGAGCCGCTG GACCAGGGGC CGTCCGATCC GCTGATGGAC AACCTGTTGC GCCGGCAGGA AGAGGCAATG CGCAACGAAC GCGAGCGGAT GAACTCGCTC GGCCTCGATC CGCAGCGCCT GGCGGGCCTC GCGCCCGCCG GGGACGACGA GGAACTGCGC CTGCAGCGGC TGGCCGACCT GCCGCGGGTC GGGCGCAATA TCCGGCGCAA GGAGGCGGAA CTCGAAGCCC AGGCGGAACG CGAGCGGGCC GCCGCCCTGG AGCGCCTGCG CGACCAGGAG CCGACGACGG CGAACCGGGA ACTGCTGGCG CAACTGGAAA ACCCGGACAG GGAACTTCCG CCTTTCGATT TCGCCGCCCG TTCCGCGCAG TTGCGGGAGG TCTACCGGAT GGACTTTTCC CCGGCGTCCG CCGACTTGCC GTCCCGGGAG GAAAGCGAGC GGCGCCTGCG CCAGCAGTAC CGGGACTCGG TGCATTGCTG CGGGGCCGCC CCGGCCCTGC AGGGCGCGGC GGCCGAGGCC CTGCGCCGGA AGGTGGCGCA AGCCTATGCG CGGGATCGCG ACCTGGCGGG TATGGATCTG ACCGGCGCCG ATCTGTCCGG CATGGACCTG TCCGGCGCGC GGCTGACCGG CGTCCTGCTG GAAAGCGCCA ACCTGGCGGA TGCGCGGCTG GACGGCGCCG ATCTGCGCGA GGCGATCCTC GCCAGGGCCC GCCTGGACGG CGCCTCGTTG CGCGGCGCCG ATTGCCGGGG CGCCAACCTG TCCCGCGCGC AGGCCCGCAA CGCCTGTTTC AGTGGCGCGA CCTTCGGCGA CGGGCAATGC TGGGAAGCAC GCTTCGAGAC CTGCGATTTC AGCCACGCCC GGTTCGGCGG GATACTGTGG CAGGGGTGCG AACTGGACGC CTGCCGTTTC GACGGGGCGG TGCTGGAAGA CTTGTCCCTG CATGCCTGCC GGCTGAGCCG GCCTTCCTTC GTCGGGGCGA GCCTGAGTGC CGTCACCTGG GTCGAGTCGA GCCTGGAGGC GGCCGATTTC GAGCGCGCGG AGCTGGACGA CTGCAGTCTG GTGGAGACGC GCTCGCCCGA TGCCCGTTTC GTCGGGGCCG CCCTGAGTGC GTGCTACATG GTGCTCGGCA GCTCCCTCGA GGCGGCGGAT TTCGGCGGCG CGCGGCTGGC CGAAAGCAAC CTGCGCGGGG TGGATTTGTC CGGTGCGCGC TTCTGCGGGA CGCGCCTGGC CGATTGCGAT CTGTCGGAGG CCCGCCTGGC GGGCGCGGAT CTGCGCCGGG CGACGGCCAG CGGCTGCCTG TTCAGCGGCG CCGACCTGCG TACCGCCCGG CTCGGAGACG CCCATCTGAT GCAGTGTCTG CTGCGCCGCG CGGATCTGCG CGGCGCCGAT CTGCGCGGCG CTTCGCTGTT CGGCAGCGAT CTGGCCGAGG TGCATCTGGA CGAGGACAGC CTGCTGGACG AGACGGATTT CGGCCGGGTC GCTTTCCACC CGCGCCGCCG CTCGGAGGCC GCGTCGTGA
|
Protein sequence | MEIIKPLRLG VLHRTYHWRH GHRLAVTAIA LATLEESPVL LPEQELWALL DEALDENEQI DLLMPKPCPE FLVNGHAYNV HGTDRRRCRV QARLDDRCKA LVVHGDRYWA EGEAGEPADF EAMPLGRRRA YGGPGYEANP IGIGHVPESV DGGERWPLPN VEHDGQPPFA PGQPPGTPAG FGMRDIDAPA HRAKLGQYDE ETLERDGPGL AESFDWRFFN LAPDDQQWPD RDRLAGGLEY EFLNLHPELA RLAGHLPDGL ARCFVMRQPA EGESALEEIP LRLTTAWFFP DRRRVALIHH GDLAVDDEQA SDIQYLMPAF EAGAAPRGLE HYAEALARRL DEEEGDLYAF DEAALIHEPF IGAGFDTEPL DQGPSDPLMD NLLRRQEEAM RNERERMNSL GLDPQRLAGL APAGDDEELR LQRLADLPRV GRNIRRKEAE LEAQAERERA AALERLRDQE PTTANRELLA QLENPDRELP PFDFAARSAQ LREVYRMDFS PASADLPSRE ESERRLRQQY RDSVHCCGAA PALQGAAAEA LRRKVAQAYA RDRDLAGMDL TGADLSGMDL SGARLTGVLL ESANLADARL DGADLREAIL ARARLDGASL RGADCRGANL SRAQARNACF SGATFGDGQC WEARFETCDF SHARFGGILW QGCELDACRF DGAVLEDLSL HACRLSRPSF VGASLSAVTW VESSLEAADF ERAELDDCSL VETRSPDARF VGAALSACYM VLGSSLEAAD FGGARLAESN LRGVDLSGAR FCGTRLADCD LSEARLAGAD LRRATASGCL FSGADLRTAR LGDAHLMQCL LRRADLRGAD LRGASLFGSD LAEVHLDEDS LLDETDFGRV AFHPRRRSEA AS
|
| |