Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2061 |
Symbol | |
ID | 3904634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2424707 |
End bp | 2425804 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879397 |
Product | IS630 family transposase |
Protein accession | YP_481163 |
Protein GI | 86740763 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3415] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.289614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0326995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGACG CGGCCCGCGG CTACTCCAAC GCGCGGATAG CCCGGCGACT GTGCGTGACC GAGGACACGG TCCGCACGTG GCGGGGCCGG TTCGCGCGGC GACGCGAGGC GGGACTGGTC GACCTGCCCC GGTCGGGTCG GCCACGGCGG ATCAGCGAGG CGGAACGGGC CGAGGTCGTC GCGCTGGCCT GCCAGTTACC TGCCGAGACC CAGGTGCCGC TGGCCCGCTG GTCGTGTCCG GAACTCGCCG CCGAACTGCT CTCCCGCGGC CTGGTCGACG CGATCTCGGC GTCGTCGGTG CGGCGGATCC TCGCCGAGCA CCCGATCAAA CCGTGGCGCT ACCAGTCCTG GATCTTCGCG CGCGGCCCGG GCTTCGCCGC GAAGGCGAAA GTGATCCTCG ATCTCTACGA GGGCTTCTAC CAGGAGGAAC CCCTCGGCCC CGAGGATCGA ATCGTCTCGA TCGACGCGAA GCCGTCGATC CAGGCGAGAG CCCGGATCCA CCCGACCACC CCGCCCGCCC CGGGCAGGAT CATCCGGGTC GAGCACGAGT ACGAACGTCA CGGCGCGCTC GCGCTGTTGC CCGCGCTCGA CGTCCAGACC GGGAGGATCG CCGCCGTGCT GACCCCGCCG ACGACGGGCA TCGCGCCGTT CATGGAACTG ATGGGCCAGG TCATGGCCCA GGACCGCTAC CGGACGGCGA AGCGCGTGTT CGTGATCGTC GACAACGGCT CCGACCACCG AGGCCAGGCC TCGATCAACC GACTCAGGGC CGCCCACCCG AACCGCATCC TGATCCACAC CCCGACACAC GCCTCATGGC TGAACCAAGT GGAGATCTTC TTCTCGCTCG TCCAACGGAA GGTCGTGTCA CCCTGCGACT TCGCCAGCCT CGACGTGCTC GCCGACACCC TGACAGCGTT CGTCGACCGC TACAACGTCA CCGCCACACC GTTCAAGTGG AAGTACACCG CGGCCGACCT CGAACGCCAC CTCGCCCGCC TCGACGACGA CACAGCACCA GCCGTCGCGG GCTCCGTCGC TCGGTTGCCC GTCCCACCAC CCGACACCAA TGAGCCTTTT CAATCTCTGT TTGTCTGA
|
Protein sequence | MLDAARGYSN ARIARRLCVT EDTVRTWRGR FARRREAGLV DLPRSGRPRR ISEAERAEVV ALACQLPAET QVPLARWSCP ELAAELLSRG LVDAISASSV RRILAEHPIK PWRYQSWIFA RGPGFAAKAK VILDLYEGFY QEEPLGPEDR IVSIDAKPSI QARARIHPTT PPAPGRIIRV EHEYERHGAL ALLPALDVQT GRIAAVLTPP TTGIAPFMEL MGQVMAQDRY RTAKRVFVIV DNGSDHRGQA SINRLRAAHP NRILIHTPTH ASWLNQVEIF FSLVQRKVVS PCDFASLDVL ADTLTAFVDR YNVTATPFKW KYTAADLERH LARLDDDTAP AVAGSVARLP VPPPDTNEPF QSLFV
|
| |