Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_05900 |
Symbol | |
ID | 7759546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 569243 |
End bp | 570388 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643803510 |
Product | transposase IS891/IS1136/IS1341 |
Protein accession | YP_002797818 |
Protein GI | 226942745 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.542736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAC GCGCCTACAA GTACCGCTTC TACCCGACTC CAGAACAAGC AGAACTGCTT GCCAGGACGT TCGGTTGCGT GCGTTCCGTC TACAATCGCA TCCTGCGCTG GCGTACCGAT GCCTTCTACC AGGAGCAGAA GAAGATCGGC TATACGGCGG CCAGCAGTCG CCTGACCGCG CTCAAGAAGC AACCGGAGCT GGCTTTTCTC AATGAAGTCA GTGCGGTGCC ATTGCAGCAG TGCCTTCGCC ACCAGCAGGC TGCGTTCAAG AACTTTTTCG AGGGTCGAGC GAAGTACCCG GTCTTCAAGA AGAAACGGCA CCGGCAGTCC GCCGAGTTCA CCAGCTCGGC CTTCCGCTAC CGAGACGGCA AGCTGTTCCT GGCCAAGTGC GACGAGCCCC TGGCGATCCG CTGGAGTCGG CCACTTCCTG GTGAGCCTTC CACGGTCACG ATTTCCCGGG ACTCTGCAGG GCGGTACTTC GTCTCCTGCC TGTGCGAGTT CGAACCCGAG GCACTGCCCG TCACGCCGAA GACGATCGGC ATCGACATGG GCATCAAAGA CCTGTTCGTC ACCAGCGAGG GCGAACGGAT CGGCAATCCC CGCCATACGG CCAAATACGC CACCCGTCTG GCTAGGGCAC AGCGTCGACT GAGCAAGAAG AAACTCGGCT CGGAGAACCG CGCCAAGGCC CGACTGAAAG TGGCCCGTAT TCACGCCAAA ATTTCCGATT GCCGAGCGGA CAGCTTGCAC AAGCTGTCCC GCAGACTGAT TAACGAGAAC CAAGTGGTCT GCGCTGAAAC CCTTGCCGTG AAGAATATGA TCCGCAATCC GAAACTGAGC AAAGCCATTG CCGATGCGGG ATGGGGCGAA TTGACGCGCC AGATCCAGTA CAAAGGTGAA TGGGCCGGTC GGCAGATCGT CCAGATCGAC CGCTGGTATC CCTGCTCGAA ACGCTGTGCC TGCTGCGGGC ATATCCTTGA GCGCCTGCCG CTGGATGTTC GCCGCTGGAG TTGCCCGGAA TGCGGAACCG AGCATGACCG CGACGTGAAC GCCGCGATCA ACATTAAAGC CGCCGGGCTG GCGGTGTTAG CCCTTGGAGA GAACGTAAGC GGCATGGGTC AAGTATCCGT GTCCTGTTCT CAGTGA
|
Protein sequence | MTKRAYKYRF YPTPEQAELL ARTFGCVRSV YNRILRWRTD AFYQEQKKIG YTAASSRLTA LKKQPELAFL NEVSAVPLQQ CLRHQQAAFK NFFEGRAKYP VFKKKRHRQS AEFTSSAFRY RDGKLFLAKC DEPLAIRWSR PLPGEPSTVT ISRDSAGRYF VSCLCEFEPE ALPVTPKTIG IDMGIKDLFV TSEGERIGNP RHTAKYATRL ARAQRRLSKK KLGSENRAKA RLKVARIHAK ISDCRADSLH KLSRRLINEN QVVCAETLAV KNMIRNPKLS KAIADAGWGE LTRQIQYKGE WAGRQIVQID RWYPCSKRCA CCGHILERLP LDVRRWSCPE CGTEHDRDVN AAINIKAAGL AVLALGENVS GMGQVSVSCS Q
|
| |