Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2616 |
Symbol | |
ID | 5671010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3096138 |
End bp | 3097856 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241532 |
Product | integrase family protein |
Protein accession | YP_001506952 |
Protein GI | 158314444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTACGT GCTCGGGTTG GCCCGTCTGC CCGAGCGGAA GGCCGACCTT CTCGCCGTCG TCCGCACACT GTGGATCTAC CGGGAGTGTG TTCCGGTCGA GTGCCGACTC GGTGGCTACC CGTGGGCGGG CACGGCCGCC AGGGATCTGG TGCGCATGCC ACCGGCGGGC CGGGAGAACA AGACCCCCGC GGATCGCGGC GCCGACGATG GAAGCGCTGC TGGGCTGGGC GCTGACCATG GTCGAGACGA TCGGGCCGGA CATCCGCGAC GCCTGGAAGG AGCTGGCCCG GCTCGAGGCC GGCACCCACC CGTCCCAGCG GATCTACGAC GGGCTGAGTG TGCCCGCCCG GCTCGACCTG TTCGTCCGGC GGGCCGCCGA GAACGGAACC CCGCTCCCCG GACGCAGCAC CGCAGGCAAG GCCGTCGCCG TCAACGGTGC CCACGTTCTC CGCCTCGTCG GCGTCCCACC GGACAAACGA TTCGGCCTGC CGCCACAGCA GCGCGCTCTG CTGGAGGAGT CCGGACTCCC GGTGGCGACC AACACCTACA TCGGTACGAT CACAGGACAC ATCGACCGCG TTCCCTGGCG GGCCGAGCCC ATCAGCGTCC CGGAGCTACC CACCATGATC AGGATGCTTT ACGCGTCTGC GTTCGTGGTC ATCTGCTACC TGTCCGGCAT GCGGCCCGGC GAGGTTCTCA ACCTGCCCCA CGGCTGCCGC GACAGCGACC CCCGCACCGG CGAACTGCTG CTACGCGGCC GGCGCGGCAA AGGCTACGAC CGCGGCCCAC TGACCCGACA CGCAGAACCC GGCCGACCGT GGGTCGTCGT CACACCGGTC CACAGCGCCG TCGAGATGCT GGAGAACCTG GCTGATTTCC CGTTCCTGTT CCCGGCAAGC CCGATCATGG CCCACGCCCA TCGGGCGAAC ACGACCCACG CCCGCTCCAC CGGGGCGATC AACCGGGATC TGGAAGACCT CACCACCTGG ATCAACACCA CGTTCGCCCG CCCCGACGGC GACGCGCCGA TCCCACCGGA CCCGACCAAG CACCTGCACG CAGCCCGGTT CCGGCGGACG CTGGCCTATT TCATCGTGCG CCGGCCCCGT GGCCTCGTCG CGGCGGCGCT CCAGTACGGA CATCTCCATA CCAAGGTCAC CCTCAACTAC GCCGGTGACG CGGACACGTC CTGGCTCGAC GATCTCCCCG TCGAACGCCT GGAGATGATC CTCGAACAGG TCGACACAGA TGCCCGGCTC CTCGAGAACG ACGAGCACGT CAGCGGCCCC GCCGCCGCCG ACTACCACGC CCGCATCACC CGGGCAGCAC GGTTCTCCGG CCGCGTCGTC AACCAGACCC GCAACGCCCA GCGGCTCCTC GCCAGCCTCG ACCCGGACAT CCACCACGGC GACGGCATCA CCTGCGTCTA CCGCGCCGAA ACAGCCGAAT GCCGGCGCAT CCTCGCCAGC CAGGGACTCG CCGCCGACAG CCCGCGAGAA TCCGAGTGCC GATCCTCCTG CACCAACCTC GCTTTCACCG ACCGGGCGGT TGACCAGCTC CATGCCCGGC TCACCCACCT CGACGCCACC GCCGATCACT CCCTGACGCC GCAACCGCTC CGCGACCGCG CCCAAGCCCA GGCGAACGCC ACGAGAGCCG TCATCGACCG ACACGTCGCG TCGTCCAGCC ACCTGACAGA ACCGGCAGGA CAACGATGA
|
Protein sequence | MPTCSGWPVC PSGRPTFSPS SAHCGSTGSV FRSSADSVAT RGRARPPGIW CACHRRAGRT RPPRIAAPTM EALLGWALTM VETIGPDIRD AWKELARLEA GTHPSQRIYD GLSVPARLDL FVRRAAENGT PLPGRSTAGK AVAVNGAHVL RLVGVPPDKR FGLPPQQRAL LEESGLPVAT NTYIGTITGH IDRVPWRAEP ISVPELPTMI RMLYASAFVV ICYLSGMRPG EVLNLPHGCR DSDPRTGELL LRGRRGKGYD RGPLTRHAEP GRPWVVVTPV HSAVEMLENL ADFPFLFPAS PIMAHAHRAN TTHARSTGAI NRDLEDLTTW INTTFARPDG DAPIPPDPTK HLHAARFRRT LAYFIVRRPR GLVAAALQYG HLHTKVTLNY AGDADTSWLD DLPVERLEMI LEQVDTDARL LENDEHVSGP AAADYHARIT RAARFSGRVV NQTRNAQRLL ASLDPDIHHG DGITCVYRAE TAECRRILAS QGLAADSPRE SECRSSCTNL AFTDRAVDQL HARLTHLDAT ADHSLTPQPL RDRAQAQANA TRAVIDRHVA SSSHLTEPAG QR
|
| |