Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43140 |
Symbol | |
ID | 7764102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4356428 |
End bp | 4358236 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643807169 |
Product | phage integrase-like protein |
Protein accession | YP_002801410 |
Protein GI | 226946337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCCT CCCCCTCCTA CCTGACCAAG AACCGCCACG GAACCTTCTA CTTCCGAATG GTCATTCCGG CCCCGTTGCG CACCCTGATC AACAGCAAGC GCGAAGTCAG ACGCAGCCTC AAGACCGACA GCCGGCGACT TGCCATCAAA CGAGCCCGCC AGTTCGCGGT CAGATACGAA ACGGCATTCG ACAAGGCGAT CAGCAGCATG ACGACTACCA GGGACGGCGA CGACGTTCTA ACCGAGGAAG ACATCAAGCT CTTGGAAGAG CTGGACCTAC CGGCAGCCGG CGCATGGTCG GATCAACCGA GCAACACCCC GCCGGAGCCG ATCCTTACCG ACGAACAGAT TGAGGCCCGA CAACGGCGCC GGGAAGTTGA ACGCCTTCTT GCTGGCGCCT ACGGCCGCGC CATTCCTACC GATCAGGAGC CGCTTGCCTC CCGGCTTCTG GAGCTTTCCA AGCCCTACCA GCCCACAGAG CTGCGGCAGA TACTCCCCAG GCTCCGGGAC GAGCTGATCA AGAGCGCCAT TGCCCCTGCA CCGGCGCCAG CACCGGCTCC GACCTTCGAT CCAGCCATGG CAGACTGGAC CCTGTACCAA GTTTGGCAAC ATCAGCTCGA ACGCGACCGG GCCGACATCG CAGCGACCGG GGGCCAGGCA CGGCACGGGG GCACCCTTGA AGAACGCGAG CGACGCGCCA GGGTTATGAC TGTGCTCACC CAGCACAAGC CTGTATGCCA GCTCTCAAAG CGCGACTGGC AGGCCGCTTA TGACGCAGCC CGCCGCATGA AAGCCGGAGT CACGGTATCC GTTGCCCCAG ACCCTCAAAC TCCGCTCGCC GAGCTTCTAA CGGACGATCC GGCGCTCATG ACCGGGCATG AACGGACAAC CGCCGTCATC GCCTCGATAA AGCAGCTCCA GACCTATGCG CGCTTCTTGG AGCTGACCAC CATCAGCCCG GATGATCTAG ACATTCCACC GATCCAAGAG CGCACGACCG CCGGCAGCCG CTCATCCAAG GCAATTTTCA CGCCATCTGA CCTGGAAAAG ATATTTTCGG GCTGGATCTA CCAGGGCGAC ATCCCCAGGC GAACCAAGGC ATATCCGTTC TGGTACTGGT TGCCACTGGT CGCCTATTTC ACCGGCGCAC GCACCGGCGA AATCACCCAG CTCGACACGG CCGACATCCG GGCTATCAAC GGCCACCCGT GCTTCGATTT TTGTGAGGAC GACCCGAAAG CCTTCGAGGC CAAACGGATC AAGACAGGGG AAGCCCGCCA AGTTCCGATT CATCCTTGCC TAATCGAACT TGGCTTTCTC GACTATGTGG CCAGCCAAGC CCAGGACAGG CAGAAGAAAC TATTCGGCGA CGGGCTGACC TACATGGAGC CCCGGCACGA CACGGACGCC AACAAAGAGG GCTGGACAAA GCGCGCCGGG AAGTTCTTCA ACGAAGCGCC AGACGGCTAT CTGGTTACAA CTGGCGTCCA CCAGCCAAGG GACGGCAAGT CGATTTATTC ATTCCGGCAC ACACTGGTAA CAACTCTCAG GAACGCCGAG CGCGGCGGTC AGGAGCTGAA ACAGACCCTT ATCAACGCCA TTACCGGACA CAGGGAAAAA GACGTGCAAG GCCGTCATTA CGACAACGGC CCAACTATCG AACTCAAACT TGACGCGCTC TTGTTGATGC CGGTCCCGGA AGCTATCCAG CGGCTCAAGG GCTACAAACC TGACTTCGTG GACCGCTTCG GCGACACTCT GACCAAGAGT ATCGCTAGCC ACCGTCGCAA GTACCCACGC ACGATATGA
|
Protein sequence | MKPSPSYLTK NRHGTFYFRM VIPAPLRTLI NSKREVRRSL KTDSRRLAIK RARQFAVRYE TAFDKAISSM TTTRDGDDVL TEEDIKLLEE LDLPAAGAWS DQPSNTPPEP ILTDEQIEAR QRRREVERLL AGAYGRAIPT DQEPLASRLL ELSKPYQPTE LRQILPRLRD ELIKSAIAPA PAPAPAPTFD PAMADWTLYQ VWQHQLERDR ADIAATGGQA RHGGTLEERE RRARVMTVLT QHKPVCQLSK RDWQAAYDAA RRMKAGVTVS VAPDPQTPLA ELLTDDPALM TGHERTTAVI ASIKQLQTYA RFLELTTISP DDLDIPPIQE RTTAGSRSSK AIFTPSDLEK IFSGWIYQGD IPRRTKAYPF WYWLPLVAYF TGARTGEITQ LDTADIRAIN GHPCFDFCED DPKAFEAKRI KTGEARQVPI HPCLIELGFL DYVASQAQDR QKKLFGDGLT YMEPRHDTDA NKEGWTKRAG KFFNEAPDGY LVTTGVHQPR DGKSIYSFRH TLVTTLRNAE RGGQELKQTL INAITGHREK DVQGRHYDNG PTIELKLDAL LLMPVPEAIQ RLKGYKPDFV DRFGDTLTKS IASHRRKYPR TI
|
| |