Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3429 |
Symbol | |
ID | 5671800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4059514 |
End bp | 4060779 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242317 |
Product | transposase IS4 family protein |
Protein accession | YP_001507737 |
Protein GI | 158315229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGAGAC GCGGTCAGGT CAAGGAGAAG CCCGAGGACC GGTTGGTGGA CCGGGTCGGG TTGGGGGTGC TGGCGGCGCA GTTCCCGGAC GCGCTGGTCG ACCGGGTGGT CGCTGAGACC GGGCGGCGTG AGCGGCGGAC GCGGGACCTG CCGGCGGCGT TGACCCTGCG GTATGTGGTG GCGTTGGCGT TGTTCCCGTC GGACGGCTAC GACGAGGTGA TGCGGCAGGT GAAGGTCGCC GACGACTGGC TGTCGGACAA GGCCGGCCCG GTGAAGGTGC CCGCGACCAC GGCGATCACG AAGGCGCGGG ACCGGCTCGG GGTGGAGCCG GTGAAGCTGC TGTTCGAGCG CACGGCGGTG CCGATGGCCC TGCCCCGGCG GACGGTGGGG GCGTTCTACC GGGGCTGGCG GGTGTGCACG GTCGACGGGA CGACGCTGCT GGTCCCGGAC ACCGACGAGA ACGCCGCCGC GTTCGGTAAG CCCGGCAACG ACCAGGGCGA GGGCGCGTTG CCGCAGGTCC GGGTGCTCGG GCTGGTCGAG TGCGGCACCC GGGCGCTGCT GGGCGCCGGG TTCGGCGGGA CCGGGGGGTC CAAGGCTGCC AGCGAGCAGG CCCTGTTCCC CGACCTGCTC GGCGCGTTGC GCCCGGGCAT GCTGGTGCTG GCCGACCGGA ACTTCCTCGG CTTCGAGTTG TTCGCCAAGG CCGCCGCGAC CGGCGCGGAC CTGCTGTGGC GGGCCAAGTC CGACCGGCGC CTGCCCATCG ACACCGAGCT CGCCGACGGC TCTTACCTGT CCCACCTCGT CGAACCCGGC ACCCGAGACA AAGGCCGGAA AATCACGGTA CGGGTCGTCG AGTACACCCT CGACCGCGAC CCCGACAGCC CCCTGCCCGC GGGCAAGAAG GAGACCTACC GGCTGGTCAC CACAATCCTC GATCCCGACG CCGCGCCCGC CACCGACCTG GCCGCGCTCT ACAGCGACCG GTGGGAAGTC GAGACCCTCC TCGACGAAAT CAAGGTCCAC CAGCAGGACG GCCGGCTGGT GCTGCGGTCC CGGGCCCCCG ACCGGGTCGA GCAGGAGGTC TGGGGCGTCC TGCTGCTGCA CCGGGCGTTA CGGAAACTGA TCCACGACAC CGCGCTGGTC GAAGGGATCG ACCCCGACCG GCTCTCCTTC ACCCACACCG TGACGATCGT CCGCCGCCAG GTTGTGCGCC GGGCGATTTT CCCCCCTCCG CCGGACGGCC CGGATCCTGG CCGCGGTGAC CACTGA
|
Protein sequence | MPRRGQVKEK PEDRLVDRVG LGVLAAQFPD ALVDRVVAET GRRERRTRDL PAALTLRYVV ALALFPSDGY DEVMRQVKVA DDWLSDKAGP VKVPATTAIT KARDRLGVEP VKLLFERTAV PMALPRRTVG AFYRGWRVCT VDGTTLLVPD TDENAAAFGK PGNDQGEGAL PQVRVLGLVE CGTRALLGAG FGGTGGSKAA SEQALFPDLL GALRPGMLVL ADRNFLGFEL FAKAAATGAD LLWRAKSDRR LPIDTELADG SYLSHLVEPG TRDKGRKITV RVVEYTLDRD PDSPLPAGKK ETYRLVTTIL DPDAAPATDL AALYSDRWEV ETLLDEIKVH QQDGRLVLRS RAPDRVEQEV WGVLLLHRAL RKLIHDTALV EGIDPDRLSF THTVTIVRRQ VVRRAIFPPP PDGPDPGRGD H
|
| |