Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49810 |
Symbol | |
ID | 7763833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5046214 |
End bp | 5049153 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643807813 |
Product | hypothetical protein |
Protein accession | YP_002802047 |
Protein GI | 226946974 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGCGCTG TGCTCTGGTT CGAGGGCACA AGCTCCTCGG CGTCGGATCT GAAAAAACTT TGGCTGTTAA GCGAGTCCCC CGAGCAGACT CTGGAGCATT GGCTCAACCA GCATCACGCA GGAGGCATGC GGGCTCTGCG TCAGATGGAG AAGGATGGGA CGGGTATCTG GCCTTATCCC AGCAAGAAAG CCTTCTTGGC CAGGCTGAGG GATTACTTCG AGGTCGGTGA AAACGCATTC ACCTTGCTCA ACCGCGCTGC CGGCCTGAAG CAGCTCAATA GTATTGATGA GATCTTTCGA GAGTTGGTGC TGGATGACCG CTCTGCCTTC GAGCGGGCTG CAGAGGTCGC CAGCAGCTTC GATGATCTGA CAGACATTCA CCGAGAGCTG GAAACCGCTA GAAAGCAGCA GCGCTCATTG CAACCGGTCG CCGATAGCTG GGAGCGTTAT AAGGCCTTGC AAGAACAGTT GCAGGATAAA CAGGCTCTCG AAGGCATCCT CCCGGTCTGG TTCGCCGAGC AGGGGTATCG CCTGTGGTTG GCTGAAACCA GGAGGCTCGA GAAGGAACAC AAACAGGCCG AACTGGATCA GGAGCAATGC AGGAGTCAGC TTGAGGTCCA GAAAGGTGTG GTCGATCAAC ATCGTCAGCG CTACCTGCGA GTAGGTGGCG CAGGGATAGA TCAATTACGC GAGCGCATCG CTGATTGGGT CAGGGAGTGC GATAAGCGCA GCCTGAAAGC CGAGCAATAC CGACGCTTGG CCAAAGGGCT TGGACTAGCG GATGAGTTAT CGGCCGCTGC TCTCAAGGAG AATCAGCAGC AGATCGCGGC ACGCCTGGAA ATACTGGCCC AACAAATTAC TGACGCGCGC CAAAAGGCGT TCGACGCTGG TCTCGTCCAG CAGGGGCTCA ATGGTCGCCT ACAGACCTTA CAGCAGGAAC GTGCTGAGGT TGAGAGACGG CCAGGCTCGA ACCTGCCCGG GCAATTTCAT GCGTTTCGAA GCGATCTGGC GCAGGCGCTT GACGTCGATG AGTCGGCCCT GCCTTTCGTT GCCGAGCTGG TGCAGGTCAA GCCCGAGGAA CTGGCCTGGC GGGGCGCCAT TGAGCGGGCA GTCGGTAGCC ACCGACTACG TATCCTGGTG CCGCAGGGCT CATCCCAGGC CGCACTGCGT TGGGTGAACC AACGGCACAA CCGCTTGCAT GTGCGCTTGT TGGAGGTAAA GGAGCCGTCT TCGCGCCCCG TGTTCTTCGA CGATGGTTTT ACCCGCAAGC TGACTTTCAA GGAGCATCCG TACCGAGAAA CCGTAAAGGC ACTACTGGCG GACAATGACC GCCACTGCGT CGAAAGCACG GAGCAATTGC GTCATACCCC TCACTCCATG ACGGCCCAGG GGCTGATGTC GGGCAAAGAA CGCTTCTTCG ACAAGCAGGA CCAGAAGCGC CTGGACGAAG ACTGGCTGAC AGGCTTCGAC AACCGCGACC GTCTGGCCTT CTTGGCTGAG CAGATTCGCG AGGTCAATGA ACAGCTCGCG CCTGCCAAGC TGGCCTTGGA TGCTGCCCAG GACGATGCCC GTCAGTTGGA GACTCAAGCT TCGCTGCTGA AACGCGTGGA AGAACTGCAG TTCGAGGATA TTGATCTGCC GGGAGCCCAG AGTCAGCTGG AGTCGCTGCG TACGCAACTG ACCACCCTGA CGCGCCCCGA TTCTGACCTT GCCATGATCA AGTCCGAACT GGATAAAGCC GAGGGCCTGC AGGACTCTCT GGATCAACAG CTCCGCCAAC TGATCGAGCA GTGCGTTCAG CTAAAAACTC AGTTTGATCA GGCCGCATCC GCTACCCGTA AGGCATATAA CGGCGCGGAG AAAGGGCTGA ACGATACGCA GCGTGAACTG GCGAAAGAGT ACTTCCCTAC TCTGGCCCCA GAAGACCTAG GCGACATTAT CGAGTTGGAG CGCAAGCATA CGCGGGAGCT TCAACAGCAG CTCAAATCGC TCAGCGAAAA ACTGAGTTAT CAGCAAGCAG AGCTTGCTAA ACGGATGTCC GACGCCCTGA AAGTAGACAC CGGTGCGCTT TCAGAGGTCG GTCGGGAGCT GGCGGACATA CCCAAGTATC TGGAGCGTCT GCGTGTGCTG ACCGAGGAGG CCCTACCCGA GAAACTCAAG CGCTTCCTGG AGTATCTCAA TCGCTCCTCA GATGATGGGG TCACCCAGTT GCTCAGCTAT ATCGATCATG AGGTCTCGAT GATCGAGGAA CGCCTAGATG ACCTGAACAG TACAATGCAG CGTGTCGACT TCCAGCCAGG GCGCTATCTA CGTTTGGTTG CAGGCAAGGT CATCCATGAA AGTCTACGCA CGCTGCAACG CGCTCAGCGC CAATTGAATT CGGCGCGCTT CATCGATGAT GAAGGTGAGA GCCACTACAA GGCATTGCAG GAACTCGTAG GGCTACTCAA AGACGCTTGC GAACACAGCA GGACTCAAGG GGCCAAGGCG CTTCTGGATC CGCGTTTCCG CCTGGAGTTC GCGGTCTCGG TGATCGACCG CGAGAGTAAA AATGTCATCG AGACCCGAAC AGGCTCTCAG GGCGGTAGCG GTGGTGAGAA GGAGATCATC GCCTCCTATG TGCTGACCGC TTCCCTCAGC TATGCGCTCT GTCCCGACGG CAGCAGCCGG CCGTTGTTCG GCACCATCGT TCTCGACGAG GCGTTCTCGC GCAGCTCCCA TGCGGTTGCC GGTCGGATCA TCGCGGCGCT GAGGGAATTC GGTCTGCATG CTGTCTTCAT CACGCCCAAC AAGGAGATGC GCCTGTTGCG CCACCATACG CGTTCGGCAG TCGTCGTTCA TCGGCGGGGC GTGGAATCCA GTTTGGTTTC CCTGAGTTGG GAAGCTTTGG ATGAGCATCA TCAACAGCGC ATCAAGGCGA TGCATGAAGT CGCCCACTGA
|
Protein sequence | MGAVLWFEGT SSSASDLKKL WLLSESPEQT LEHWLNQHHA GGMRALRQME KDGTGIWPYP SKKAFLARLR DYFEVGENAF TLLNRAAGLK QLNSIDEIFR ELVLDDRSAF ERAAEVASSF DDLTDIHREL ETARKQQRSL QPVADSWERY KALQEQLQDK QALEGILPVW FAEQGYRLWL AETRRLEKEH KQAELDQEQC RSQLEVQKGV VDQHRQRYLR VGGAGIDQLR ERIADWVREC DKRSLKAEQY RRLAKGLGLA DELSAAALKE NQQQIAARLE ILAQQITDAR QKAFDAGLVQ QGLNGRLQTL QQERAEVERR PGSNLPGQFH AFRSDLAQAL DVDESALPFV AELVQVKPEE LAWRGAIERA VGSHRLRILV PQGSSQAALR WVNQRHNRLH VRLLEVKEPS SRPVFFDDGF TRKLTFKEHP YRETVKALLA DNDRHCVEST EQLRHTPHSM TAQGLMSGKE RFFDKQDQKR LDEDWLTGFD NRDRLAFLAE QIREVNEQLA PAKLALDAAQ DDARQLETQA SLLKRVEELQ FEDIDLPGAQ SQLESLRTQL TTLTRPDSDL AMIKSELDKA EGLQDSLDQQ LRQLIEQCVQ LKTQFDQAAS ATRKAYNGAE KGLNDTQREL AKEYFPTLAP EDLGDIIELE RKHTRELQQQ LKSLSEKLSY QQAELAKRMS DALKVDTGAL SEVGRELADI PKYLERLRVL TEEALPEKLK RFLEYLNRSS DDGVTQLLSY IDHEVSMIEE RLDDLNSTMQ RVDFQPGRYL RLVAGKVIHE SLRTLQRAQR QLNSARFIDD EGESHYKALQ ELVGLLKDAC EHSRTQGAKA LLDPRFRLEF AVSVIDRESK NVIETRTGSQ GGSGGEKEII ASYVLTASLS YALCPDGSSR PLFGTIVLDE AFSRSSHAVA GRIIAALREF GLHAVFITPN KEMRLLRHHT RSAVVVHRRG VESSLVSLSW EALDEHHQQR IKAMHEVAH
|
| |