Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_20150 |
Symbol | |
ID | 7760943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2003956 |
End bp | 2005962 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804912 |
Product | hypothetical protein |
Protein accession | YP_002799195 |
Protein GI | 226944122 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.627215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGACAG ACCTTCCGCT CCCATCCGCG ACTTCCGCCC TGCCCCCGGT GCTCGCCGGC CCGCTGCTGC GCCGGCTGAG CCCGGAGCGC CTGATCCTCT GGCTGGTCGG TTCGCGGCCT CTGCAGTTGT CGCTGCAACT GGCCTGCGCC GGCTCGCAGC AGCGCCTCAT CGCGCTCGAC GCGCCGCTCT GCCGCCGCCT GCGCGTCGGC ACCCACGCCT GGCTGCATCT GATAGACGTG CCGCTGGACA CACCGCTGCC CACGGACACC CCGATCGAAT ACGACCTGCT GCTGGAGGAG GACGGCCGCC GGAGCGGCAT CGCCGACTGG GCGCCGCATC TTCTGCACGA GGGTGCCAGG CGGCCGGATT TCCTGCTCAA GTCGCATCTC GACAACCTGC TGCACGGCTC CTGCCGCAAA CCGCACTTCC CCGGCGCGGA CGGCCTGGCC AAGGTCGACA GCCTGCTCGC CGGCCTGCGC GAGCGGCCGC AGGCGCGGCC GGCGCTGCTG ATGCTGAGCG GCGACCAGAT CTACGCGGAC GATGTCGCCG GCCCGATGCT GGCGGCCATC CATGCGTTGA TCCGTCGCCT CGGCCTCTAC GGCGAACGGC TCGAAGGCGC CCTGGTCGCC GACAGCGACG CGCTCTACGC CCATCCCCTG ACCTATTACC GGCGCGAGGA CCTGCTGCCC GCCTTCAAGT CCAGCGAGGC CCTGCGCGAA CGCTTCTTCG GCGGCGTGGA AAAACCGGTG TTCACCAGCG CCAACGCCCA CAACCACCTG GTCACCCTCG CCGAGGTGCT GGCGATGTAC CTGCTGGTCT GGTCGCCGCT GCCCTGGACG CTGATCCGCC CCGCGCGACC GGCGCTGGAT GCCGACCTCG CCGAGCGCTA CGCCACGGAA CAGGCCTGTC TCGACGGCTT CCTCGCCCAG TTGCCGCAGG CCGCCCGCGC CCTGGCGCAT CTGCCGTGCC TGATGATCTT CGACGACCAC GACGTCACCG ACGACTGGAA CCTGTCCGCG CGCTGGGAGG AAACCGCCTA CGGCCACCCC TTCTCCCGGC GCATCATCGG CAACGCGCTG ATCGCCTATG CCCTGTGCCA GGGCTGGGGC AACGACCCGG ATGCGTTCGG CGAGATCCTG CAGCGGATCG AGGCGCTGAC CGCGGACGCC GACGAGCGCC TCGACGACGC AGCCCAGGAC GGGCTGATCG ACGAGTTGCT GAAATTCCAG AGCTGGCACT ACCGGCTACC GACCACGCCG CCCCTGATCG TGCTGGACAC CCGCACCCGC CGCTGGCGCA GCGAGAGCCA CCCGAGCCGG CCATCCGGAC TGATGGACTG GGAAGCCCTC AGCGAACTGC AGCAGGAACT GCTCGACGAG CAGGCCGCGG TGATCGTCTC GCCGGCGCCG ATCTTCGGGG TCAAGCTGAT CGAGGCCGTC CAGCGCCTGT TCACCCTGGC CGGCCACCCG CTGCTGGTCG ACGCGGAGAA CTGGATGGCC CATCGCGGCT CGGCGAAAGT GATCCTCAAC ATCTTCCGCC ACTCGCGCAC CCCCGCCAAT TACGTGATCC TCTCCGGCGA CGTGCATTAT TCCTTCGCCT ACGACGTGCG CATCCGTCAC CGCAGCGGCG GACCGCAGAT CTGGCAGATC ACCAGCAGCG GCATCAAGAA CGAATTTCCC GCCGGCCTGC TCACCTGTTT CGACCGGCTC AATCGCTGGC TCTACACACC CTGGTCGCCG CTCAACTGGC TGACCAAGCG GCGCCGGATG GAGGTGGTGC CGCGCATTCC CCATCGCGGC CGGCGCGGCG AGCGCCTGTG GAACGGCACC GGCATCGGCC AGTTGCTGCT CGACGTCCTG GGCCGTCCCC GCAGCATCCT CCAGCACAAT GCCGACGGCT CGGCGCCGAC CCGCTTCGTC CGCTCGGCGG AACTGCAGCG GGCGTCCGGC CGGACCGAGG ATGGCATCGT CGAAAGCGCC CGGCTGCGTC GCGGCAAGGC GGAGTGA
|
Protein sequence | MPTDLPLPSA TSALPPVLAG PLLRRLSPER LILWLVGSRP LQLSLQLACA GSQQRLIALD APLCRRLRVG THAWLHLIDV PLDTPLPTDT PIEYDLLLEE DGRRSGIADW APHLLHEGAR RPDFLLKSHL DNLLHGSCRK PHFPGADGLA KVDSLLAGLR ERPQARPALL MLSGDQIYAD DVAGPMLAAI HALIRRLGLY GERLEGALVA DSDALYAHPL TYYRREDLLP AFKSSEALRE RFFGGVEKPV FTSANAHNHL VTLAEVLAMY LLVWSPLPWT LIRPARPALD ADLAERYATE QACLDGFLAQ LPQAARALAH LPCLMIFDDH DVTDDWNLSA RWEETAYGHP FSRRIIGNAL IAYALCQGWG NDPDAFGEIL QRIEALTADA DERLDDAAQD GLIDELLKFQ SWHYRLPTTP PLIVLDTRTR RWRSESHPSR PSGLMDWEAL SELQQELLDE QAAVIVSPAP IFGVKLIEAV QRLFTLAGHP LLVDAENWMA HRGSAKVILN IFRHSRTPAN YVILSGDVHY SFAYDVRIRH RSGGPQIWQI TSSGIKNEFP AGLLTCFDRL NRWLYTPWSP LNWLTKRRRM EVVPRIPHRG RRGERLWNGT GIGQLLLDVL GRPRSILQHN ADGSAPTRFV RSAELQRASG RTEDGIVESA RLRRGKAE
|
| |