Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3272 |
Symbol | |
ID | 5671646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3876749 |
End bp | 3877921 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242164 |
Product | transposase IS4 family protein |
Protein accession | YP_001507584 |
Protein GI | 158315076 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5659] FOG: Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACAGT CCCGATCTGT CGTGGTGGTG GAGGCGTGGG CGGCTGAGCT GGAGGGGTTG TGTTCGCTGG TCGGGGGCCG GTTCGGGAGG GCGGAGCCGC GGCGGCGGGT GGCCGAGTAT GTGTCGGGCC TGGTCGCGGG TTTGGATCGC AAGAACGGCT GGACGTTGGC GGAGCGCGCC GGGGAGGTGA GCCCGGACGG GATGCAGCGT CTGCTGCGCC GCGCTGACTG GGACGTCGAC GGCGTCCGCG ACGACATCCG CGATCATGTG GTGGGCCGGC TCGGTGACCC GGACGCCGTG CTGATCGTCG ATGACACCGG GTTCCTGAAG AAGGGCACCC GGTCGGCCGG GGTGCAAAGG CAGTACTCCG GGACGGCGGG GCGTACGGAG AACTGCCAGG TCGGGACGTT TCTGGCCTAT CGGTCCCGGT TCGGGCAGGC GCTGATCGAT CGGGAGTTGT ATCTGCCCGA GGGGTGGATC GCTGATCGGG AGCGCTGCCG CCGGGCGGGA ATCGACGACG AGGTGGCGTT CGCAACGAAG CCTCGCCAGG CCCTCGCCAT GATCGAGCGG ACGGTCGCGT CAGGGGTGCC GTTCGGCTGG GTGACTGCCG ACGAGGCCTA CGGACAGGTG AAATACCTGC GAGTCTGGCT CGAACAACAC GACGTGGCGC ACGTGCTGGC GACCCGGCGC AACGACGACC TGATCACGAC CACGATGGGC CAGGCCAGAG CCGACGAGCT GATCGCCGGA CTCTCGCCGC GGGCCTGGTG CCGGATCTCG GCCGGCACCG GTTCCCACGG GCTGCGGGAC TACGACTGGG CGCGGGTACC GATCCGCATC CGGACCTGGT GGACACCAGG CCGCGGCCAC TGGCTGCTCG CCCGCCGCAG CCGGACGTCC GGCGAACTGG CCTACTACAT CTGCTACGGC CCCCGCCGCA CCTCGCTGGC CCAGCTCGCG ACCGTCGCCG GTGCTCGCTG GGCCATCGAA GAGGCCTTCC AACAGGCCAA GCAGACCTGC GGGCTGGACG ACTACCAGGT CCGCGACTAT CGAGCCTGGT ACGCCCACAT CACCTTGTCG ATGCTCGCCT ACGCAGCCCT TGCCACGGTC CGCGCCGAAC AGGTCAAAGC CAGCCAGGTA AAAGGGGCCG AAGCCCAGCC CACCAGGGCA TGA
|
Protein sequence | MKQSRSVVVV EAWAAELEGL CSLVGGRFGR AEPRRRVAEY VSGLVAGLDR KNGWTLAERA GEVSPDGMQR LLRRADWDVD GVRDDIRDHV VGRLGDPDAV LIVDDTGFLK KGTRSAGVQR QYSGTAGRTE NCQVGTFLAY RSRFGQALID RELYLPEGWI ADRERCRRAG IDDEVAFATK PRQALAMIER TVASGVPFGW VTADEAYGQV KYLRVWLEQH DVAHVLATRR NDDLITTTMG QARADELIAG LSPRAWCRIS AGTGSHGLRD YDWARVPIRI RTWWTPGRGH WLLARRSRTS GELAYYICYG PRRTSLAQLA TVAGARWAIE EAFQQAKQTC GLDDYQVRDY RAWYAHITLS MLAYAALATV RAEQVKASQV KGAEAQPTRA
|
| |