Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50760 |
Symbol | |
ID | 7763924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5142608 |
End bp | 5146426 |
Gene Length | 3819 bp |
Protein Length | 1272 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643807904 |
Product | hypothetical protein |
Protein accession | YP_002802138 |
Protein GI | 226947065 |
COG category | [S] Function unknown |
COG ID | [COG3523] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGC CGGAAATCGA CGTGGCCGTG ATTGTCGCGC TCCTGCTGTT GCTGGCGGCC GCGCTGTTGC TCGCCTGGCT GCGCAGCCAG GCGGGCTCGG CGGTGCGCGG CTTCTATGCC GCGGTTCGCC AGATGGAGCA CGACCGGGCG GTGAACGATC GCTACCAGAC GCCCTGGCTG CTGATGATCG GCGACGAGGA ACGCTGCGCG CAGCTATGCC TGGACTGGCA GTTGAAGGCG GCCGGCAAGC CCGCCTGGTT CGGTTGCTGG TGGAGCGACC CGGATGGCGC GCTGTTGGTC GTGCCGGATG CCCTGTGCAT GCCGGAGGAA GGCCGGCCGG GCCGCTCCGC CGCCTGGTGG CGGGTGCTCG GCCTGCTGCT GCGCCTGCGC TCCAGCCGCC CCCTGGATGC GCTGGTCTGG GTCGTGCCGG TCGCCGTCCT GCTGGATGGC GAGCAGGCCG TGGCCCGCGG AATCGCCGCG CGCCGTGCTT TCATCGAGCT GCTGCAGCGC CTGGGACTCA GCCTGCCGGT CTATGTGCTG GTGACCGGCA TGGAGGAAGT GCCCGGCTTC CAGGAACTGA TCGCGGCGCT GCCCGAGGAG GCGCGCGAGA CACCGCTCGG CTGGTCCTCG CCTTTCGCGC CGGAGGCCGC CTGGCAGTCG CACTGGAGCG ATCTGGCTCT CGATCGGCTG GGGCGCGCGC TGTCCGAGGC GATCGTCGAG CTCGGCGCGC TGTCGGGACG GCTCGGCGAG GCCCTGTATT GCCTGCCGCA GCGCTTCGAG GACATGCGCG GCGGCCTGCA CGCGCTGCTC GACCCGGTGT TCCAGGGCAA CGCCCTGGGC GAGGCGCCGC GCTTGCGCGG CCTGTACTTC GTCGCCGGCC GGGCGGCAGC GGGCGGCGAA CCGGGCCAGG ACCTGTCGGG CCTCGACGTG CCGCCGCTGC GTGGCCTGTT CTCCCGTCAG CTCTGGCAGC GCCGCATCCT CGCCGAGCAG GGGCTGGCCC GGGCGGTGCC GCGCATCCTC CGCCTGCGCC AGCGCTGGCA GCAGGTCGCC GGTGCCGCCG CCCTGGTGTT CGGCCTGTTC TGGAGCGGAG CGATGCTGTG GGTCTGGTAC GAGTCAAGCC GCGAGGCCGA GGATCTGGCC AGGCTGCTGC AGGAAACCCG CAACCGCTAC GTGGCGCTCG ACGACGACGC CCGCCACCGG GAGCTGAGCC GGCAGAACGC CCAGGCCTTC TGGACCCTGC TGGAACGCGC GCCGCGCTGG CGTTTCGCGT CTCCGGCCTT TCCGACCTCC TGGCTCTCGC CGCTGGACCG TCGGCTGGAG GCAAGCCTGG CGGGCGCTGC GCGGGAGCGG CTGTTCCAGC CGCTGCGCGA CCAGTTGGCG GCCGATCTGG ATGCACTGGG TGCGATGCAC GGTGCCGGGC GGCATTCCAG CCCCGAGGGC GACAGCCCCG AGCAGTGGCA GAACTATGTC CTGGCCAAGG GCCTGGTCGA GCGGGTCACG CGCCTCGAAC AGCAGAACCG CTGGCTCGCC CAGGCCCTGG GCAGGCCCCT GGCGCCCCTG GAAACCCTCG CCGCCATGGG CCGCGATGCG CTTGGCCTGC AGCCGGGCGC CGGGCCGCTG CGCCACGAGG CCTTCTACAA CAGGCTCCTG CGCACGGCGC CCGCGCCGGC GATGCAGGCG CTGGACCTGC ACGCGAACCG GAAGATCGCC GAACGTTTCC AGAACCTGAT GCGGCAGTGG CTGACCCAGT ATTTCCTGGC CGAACACTTC GTCCGGCCGG CCGGCTACCT GAAGATTCAC CTCCAGAAGC TGCAGGCCGG CTACGGCAAC TCCCTGCGCG AACTCGAGGA GGTCGGCGGG CTGATCGACA ACCTGCGGGA TCTCGTCGCC CTGATCAACG CGGCCTGGGC GCGCGGCAAC GGGCGGGACC GGGTGCCGGG CTACGAGGCG ATGCTCGATG GCGCGCGCCA GACCACCCTG CTCGGGCCGG CGGTGGTACA GGCCGTCGAG CAGGAGGCCT CGCGCCTGCA GAAGAGCTTC CGCGACCAGT GGATCGCCCA GGCGGGCTCG CGCGACAACC TGCTGGTGCT GCTGGGCGGC GGCAGTCTGG AACTGCAGGA GCAGGTCGCC GGGCTGGATA AGTCCATCGA AAGCCTGCTA CGGCACGATT TCGCCGCCAT CGCCTTGCGC CGCGAGGGCG AGGGCGAGGG CGAGGATGAG TCCCGCGCCC AGCCCCTGGC CATCGATGCC CGCGCGCTCG ACACCGCGCT GGGCTACTAC GCCGGCTACA AGGGTTATGC CGACCAGGAG CTGGCGCAGA TACCGCCGCC GTACCGTGCG GCCCTGCTCG GAGCGGCCGC ACGCGCGGCT TCGCTGGCCA TGTGGAGCAG CCTGGAGTCG AAGGGGGGCG TGCCGCGCCT GGGCAGCGAG CGTTCCTTCG ACGTGCCGGC GGACAAGGCC CTGCAACTGC AGGAGGCTTT CGCCGAGCTG AACAGTGGCG ATCTGGCCCA GGCTTTGCTG GAAAACCTCA ACCGCCGGGC CCTGGGCGAC GTGCAAGAAG CGCTGGCGGA GGTCGAGGCG CTGCCCATGT TCCACCAGGG CTACGACTTC GCTCTGTGGG ATGGCGGCAA GGGACCGGGC CTGCAGATGT TCCGCGCCCA GGATCTGCAG GACCTCAAGC AGGGGCTGGC CCAGCAGTTC GCGGTCATGC TGGAAGCCAC CCAGGCCAGC GCCCGGGCGC TGGAATGGCT GCGGATGCAG CGCTCGCACC TGTCGCTGGC CGACTACGAC AAGGTCCTGC GGCTGACCGC GCTGGGCGAG GAGATGCTCA AGCACAAGGC GCAGAACCCG GCCAGCGCGC CGGCGCTGTT CCAGCAACTG GTCGCCCGCG ACTTCGTCGA GATGGACGGC GGAACCTGCC CGGGCATCCT CCAGGCCGCC TATCTGGCGC AGGGACAGGA TGACCTGTCG CGGCGCGGGC AGTTCCTCTG GGAGGAGGCG CGGCAGCGCT GCGACCTCCT CCAGCAACAG CGCGCGGCCA CCGCCTGGAG CCGCCTGGCC GACTATTTCA ACCAGTATCT GGCCGAGCGC TTCCCGTTCT CCCACGATAT CCGTGCCGTC GATGTCGATC CCGAACGGGT TCGCCACCTG CTGCGGCTGA TCGACGAGCA CCTGCCCCAG GCCGAGGCGG GCCTGCAACT CGCCCGCCGG TCCGACCGGC AGGCCGCGCA GGACTTTCTG CTGCGTCTGA AGCAGGCCTC CGGTTGGCTG GGAGCGTTGC TGCTGCGCGA CAAGAACGGT CTGCAGGGGC TCGACGCGGA GGTGCGCTGG CGCACCGACC GCGCGGACGA GCGCGGCGCC GACCAGGTGA TCGCCTGGCT GCTGCAGGCG GGCCGCCAGG AGATCGCCTA TCCGGGCGAC GATCCGCAGC GCCTGCGCTG GACCCAGGGC GAACCGGTCC GCCTGGTGCT GCGCTGGGCG AAGAACGGCT CGCAGCGGCC GGTGAACGAT CCGCTGCAGC CGAGCCTGGC GGTCGCCGAC CGCGAGGCCG GCTGGGAATA CACCGGTCCC TGGGCGCTGC TGCGCCTGAT GCGTACCCAC ATCGCCGTGC AGCGGCAGTC GTACGTCGAC TACACCGACT TCCCGCTGAC CTTCCAGTTG CCGGTCTACG GGGCTTACAG CAGCGACAGC CGGGCGATCC TGTTCATGCG CGTGTCGCTG ATGACCCAGG GCGGCAAGGC GCCGCTCTCC ATCCAGCCGC TGCCGGTACG TGCCCCCCGT TCCCCTTTCG CCGCGCCGTC CGTGCCGCAG CGCTGGTGA
|
Protein sequence | MSLPEIDVAV IVALLLLLAA ALLLAWLRSQ AGSAVRGFYA AVRQMEHDRA VNDRYQTPWL LMIGDEERCA QLCLDWQLKA AGKPAWFGCW WSDPDGALLV VPDALCMPEE GRPGRSAAWW RVLGLLLRLR SSRPLDALVW VVPVAVLLDG EQAVARGIAA RRAFIELLQR LGLSLPVYVL VTGMEEVPGF QELIAALPEE ARETPLGWSS PFAPEAAWQS HWSDLALDRL GRALSEAIVE LGALSGRLGE ALYCLPQRFE DMRGGLHALL DPVFQGNALG EAPRLRGLYF VAGRAAAGGE PGQDLSGLDV PPLRGLFSRQ LWQRRILAEQ GLARAVPRIL RLRQRWQQVA GAAALVFGLF WSGAMLWVWY ESSREAEDLA RLLQETRNRY VALDDDARHR ELSRQNAQAF WTLLERAPRW RFASPAFPTS WLSPLDRRLE ASLAGAARER LFQPLRDQLA ADLDALGAMH GAGRHSSPEG DSPEQWQNYV LAKGLVERVT RLEQQNRWLA QALGRPLAPL ETLAAMGRDA LGLQPGAGPL RHEAFYNRLL RTAPAPAMQA LDLHANRKIA ERFQNLMRQW LTQYFLAEHF VRPAGYLKIH LQKLQAGYGN SLRELEEVGG LIDNLRDLVA LINAAWARGN GRDRVPGYEA MLDGARQTTL LGPAVVQAVE QEASRLQKSF RDQWIAQAGS RDNLLVLLGG GSLELQEQVA GLDKSIESLL RHDFAAIALR REGEGEGEDE SRAQPLAIDA RALDTALGYY AGYKGYADQE LAQIPPPYRA ALLGAAARAA SLAMWSSLES KGGVPRLGSE RSFDVPADKA LQLQEAFAEL NSGDLAQALL ENLNRRALGD VQEALAEVEA LPMFHQGYDF ALWDGGKGPG LQMFRAQDLQ DLKQGLAQQF AVMLEATQAS ARALEWLRMQ RSHLSLADYD KVLRLTALGE EMLKHKAQNP ASAPALFQQL VARDFVEMDG GTCPGILQAA YLAQGQDDLS RRGQFLWEEA RQRCDLLQQQ RAATAWSRLA DYFNQYLAER FPFSHDIRAV DVDPERVRHL LRLIDEHLPQ AEAGLQLARR SDRQAAQDFL LRLKQASGWL GALLLRDKNG LQGLDAEVRW RTDRADERGA DQVIAWLLQA GRQEIAYPGD DPQRLRWTQG EPVRLVLRWA KNGSQRPVND PLQPSLAVAD REAGWEYTGP WALLRLMRTH IAVQRQSYVD YTDFPLTFQL PVYGAYSSDS RAILFMRVSL MTQGGKAPLS IQPLPVRAPR SPFAAPSVPQ RW
|
| |