Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_45820 |
Symbol | |
ID | 7764190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4651655 |
End bp | 4653022 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643807427 |
Product | group II intron maturase |
Protein accession | YP_002801668 |
Protein GI | 226946595 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATGT CGGAGGCCAT GCACCAGAAG CCCGCGTGCG CGGGGCGGCA TGCCGCAGGC CAGGGTGAAG CCCCGGCCAA GGTATCCCGT GGTGAAGCCG AAGGCCCGCG ACATGAGATG GAAGGCACAG GGTCGGCGCT GCTGGAAGCG GCGCTGACGC GAGAGAACCT GCGGCAAGCG TTCAAGCGGG TGCGAGCCAA CCGGGGATCG GCGGGCGTGG ACGGTCTGGA CATCGACCAG ACGGCGCGCA AGCTGGTGAC CGAGTGGCCT GCGATCCGGG AGCAATTGCT GCGGGGGACG TACCGGCCCA GTCCGGTACG GCGGGTGATG ATTCCGAAGC CGGATGGGAG CCAACGAGAA TTGGGTATTC CGACGGTGAC GGATAGACTG ATCCAGCAGG CGTTGTTGCA AGTGCTGCAA CTGCTGCTTG ATCCGAGCTT CAGTGAGCAC AGCTACGGGT TCAGGCCCGG AAGACGGGCG CATGACGCGG TGTTGGCCGC GCAATCGTTC GTGCAGTCGG GCCGGCGGAT AGTGGTGGAC GTGGACCTGG AGAAATTCTT CGACCGGGTC AACCACGACA TTCTGATCGA CCGCCTACGC AAACGCATCG ACGACGCGGG AGTCATCCGG CTGATTCGTG CGTACCTGAG CGCGGGGATC ATGGATGGCG GGGTGGTCAT CGAGCGGGAC CAGGGGACGC CGCAAGGCGG GCCGCTGTCG CCGCTGTTGG CCAACGTCCT GCTCGACGAG GTGGACCGGG CGCTGGAGCG GCGGGGCCAT TGCTTCGTGC GCTACGCCGA TGACTGCAAC GTGTACGTAC GCAGTCGGCG GGCGGGCGAG CGGGTGATGA ATCTGCTGCG CAAGCTGTAC GGCCGGCTCA AGCTGAGGGT CAACGAAGCC AAGAGCGCGG TGGCCAGTGC GTTCGGCCGC AAGTTCCTGG GGTATGCCTT CTGGGCAGCG CCGAAGGGAC AGGTCAAGCG CAAAGTGGCG GCCAAGCCGT TGGCGACGTT CAAGCAGCGG ATCAGGCAAC TGACGCGGCG CAGCGGTGGG CGCAGCATGG CGCAAGTCGT GCAGGAGCTG CGTCCGTATG TGCTGGGCTG GAAGGCTTAC TTCGGACTGT CGCAGACACC GAGAGTCTGG CGTTCGCTGG GCGAATGGCT GCGGCATCGG TTGCGTGCCG TCCAGCTCAA ACAGTGGAAA CGCGGCAAGA CCCTGTTTCG GGAACTGCGC GCCCTGGGGG CCAGCCACGA GGTGGCGCAA CGGATCGCGG CCAACAGCCG CCGATGGTGG CGCAACAGCG GCAAGCTCCT GAACAGCGTG CTCAACCTGG CGTGGTTCGA CCGGCTGGGC CTACCCCGAC TCGCCTGA
|
Protein sequence | MSMSEAMHQK PACAGRHAAG QGEAPAKVSR GEAEGPRHEM EGTGSALLEA ALTRENLRQA FKRVRANRGS AGVDGLDIDQ TARKLVTEWP AIREQLLRGT YRPSPVRRVM IPKPDGSQRE LGIPTVTDRL IQQALLQVLQ LLLDPSFSEH SYGFRPGRRA HDAVLAAQSF VQSGRRIVVD VDLEKFFDRV NHDILIDRLR KRIDDAGVIR LIRAYLSAGI MDGGVVIERD QGTPQGGPLS PLLANVLLDE VDRALERRGH CFVRYADDCN VYVRSRRAGE RVMNLLRKLY GRLKLRVNEA KSAVASAFGR KFLGYAFWAA PKGQVKRKVA AKPLATFKQR IRQLTRRSGG RSMAQVVQEL RPYVLGWKAY FGLSQTPRVW RSLGEWLRHR LRAVQLKQWK RGKTLFRELR ALGASHEVAQ RIAANSRRWW RNSGKLLNSV LNLAWFDRLG LPRLA
|
| |