Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3535 |
Symbol | |
ID | 5671905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4194350 |
End bp | 4196026 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242422 |
Product | transposase IS4 family protein |
Protein accession | YP_001507842 |
Protein GI | 158315334 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGC AGCCACGTCC GTGGCCGCAG GTTCCTGAAC AGACCGCGGC GGTGGCCTGT GCGGCGTTCC CGAAAGGCAC ACTGGCAATC CGTGTTCGCG ATGAGCTGCC CGAGTTGTTT GCTGATGAGC AGTTCCTCGC AGCGTTCGGC GTGCGCGGTA GACCAGGCAT CTCACCGGGG CAGTTGGCGC TGGTCACGGT GTTGCAGTTC GCGGAGAACC TCACCGACCG GCAGGCGGCC GACGCGGTAC GGGCCCGGAT CGACTGGAAA TACGCCCTCG GTCTGGAGCT GACCGACGCA GGGTTTGATC ACACTGTGTT GACCGGGTTC CGGCAGCGGC TTATCGACCA TGGTCTGGAG GAGAAGGTAC TGGACCTGCT GCTGGCCCGG CTGTCCGAGT TAGGACTCGT CAAGGCTGGC GGCCGGCAAC GCACCGACTC TACGCATGTG CTGGCGGCGG TACGCTCGGC CAATCGGCTG GAGTTCCTCG CCGAGACACT GCGAGCGGCC TTGGAGGCGT TGGCCGTGGC CGCACCGGAC TGGCTGAGGG CCCAGATCAA CACCGAATGG GTGACACGGT ACGGCGCCCG TATGGATTCC TACCGGATTC CGAAGGGCGA CGACAAGCGT AAAGCGATGG CCATTCAGGT TGGAGTCGAC GGGTTCGGTC TTCTGGAAGC CGTACACACC GTCGGCGCAC CGATCTGGCT ACGTGAGATC CCCGCCGTGG TCACCCTGCG TGCGGTGTGG CTCAGGCAGT ACCACCGCAC GATCACCCAT GACGGGCAGG AGGTGGCGTG GCGGGAGGAA AAAGACCTCC CGCCCAGCAG AGACCGGATC TGCTCGCCGT ACGACACCGA CGCCCGGTAC GCGACCAAAC GCGGTTCCGG CTGGGAGGGC TACAAAGTCC ATCTCACGGA GACCTGCGAC GACGTGAGCA CGACCGGCGC GCCACACCTG GTCACGAATG TGACCACCAC CGACGCGACC GTCACCGACG TGGAGATGCT CGAACGGATC CACAAGGATC TCGACCGCAG ATCGTTACTT CCGGCGGAGC ACCTGGTCGA CGCCGGCTAC ACCAGCGCCG AGCTCCTCAT CGACTCCCAG CGCGATTTCA GTATCACGTT GCTCGGTCCG CTGCCGGCCG ACAACTCCCA CCAGGTTCAG GCCCGTGGTG GCTTCGAACG CGCCGCGTTC GCCATCGACT GGGACAACCA GCGGGTCACC TGCCCGCAAG GCGTGACCAG CACGATCTGG TCGTCCTGCA ACGAACGCGG CCGGGAATCG ATCGTGGTTC GTTTCCCCGT CACAGCCTGC CAGCCATGTC CCGTCCGTTC ACAATGCACA CGAGCCACCC GGAACGGCCG CCAGTTGATG CTACGTCCCC GCGACATCCA CGAAGCGGTC GAGCAGGCCC GCGCCAAACA GAACACCGAC GAGTGGAAAC AGCGCTACGC AACCCGCGCC GGCGTCGAGA GCACCATCCA TCAATCAGTT GCCGTCACCG GGATCCGCCG CTGCCGCTAC ACCGGACTAC CCAAGACCCG ACTTGCCCAC GTCCTCGCCG CCACCGCCCT CAACCTGATC CGGTTGGACG CGTGGTGGAC CGGCACGCCA CTCGACCGGC CTCGGGCCAG CCACCTCGCA AGACTCGACT TCAGCCTCGC CGCATAG
|
Protein sequence | MSMQPRPWPQ VPEQTAAVAC AAFPKGTLAI RVRDELPELF ADEQFLAAFG VRGRPGISPG QLALVTVLQF AENLTDRQAA DAVRARIDWK YALGLELTDA GFDHTVLTGF RQRLIDHGLE EKVLDLLLAR LSELGLVKAG GRQRTDSTHV LAAVRSANRL EFLAETLRAA LEALAVAAPD WLRAQINTEW VTRYGARMDS YRIPKGDDKR KAMAIQVGVD GFGLLEAVHT VGAPIWLREI PAVVTLRAVW LRQYHRTITH DGQEVAWREE KDLPPSRDRI CSPYDTDARY ATKRGSGWEG YKVHLTETCD DVSTTGAPHL VTNVTTTDAT VTDVEMLERI HKDLDRRSLL PAEHLVDAGY TSAELLIDSQ RDFSITLLGP LPADNSHQVQ ARGGFERAAF AIDWDNQRVT CPQGVTSTIW SSCNERGRES IVVRFPVTAC QPCPVRSQCT RATRNGRQLM LRPRDIHEAV EQARAKQNTD EWKQRYATRA GVESTIHQSV AVTGIRRCRY TGLPKTRLAH VLAATALNLI RLDAWWTGTP LDRPRASHLA RLDFSLAA
|
| |