Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6280 |
Symbol | |
ID | 5674599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7627254 |
End bp | 7628261 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245132 |
Product | integrase family protein |
Protein accession | YP_001510528 |
Protein GI | 158318020 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.737923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC CTGCCGGAAC CCTCGCCCCG GTTCCGTCCG TTGACCTCGA CATCGTCGGT GCCCGGGACG CGAGCGTCTC CGCGACCACC GCCCGCATGA TCGACGACGC GACCGCAGCG AACACCAAGC GCGCGTACGC CAGACAGTGG TCGACCTTCA CGGCCTGGTG TTCCCAGGAG GGCCGGACCA TGCTCCCGTG CTCGGATGCC ACGCTCGCCG AGTACGCGGC GCAGCTCGTC ACCGCCGGGG CCGGGCCGGC CACCGTGGAA CAGGCCATTG CCACCATCCG CCGGGTCCAC CGCGACCAGG CCGCCACCCC GCCCGACACC CGCGCCGCAC GGCTAGTGCT GCGCACAGCC CGCCGGGAAC GCGCCGACGC CGGGCAGGCC ACCCGACAGG CACCGCCAGC CGCGCTGGAT CAGCTCCGCG CCATGCTCGC CGCCTGCGAC ACCAGCACCC GCGGAGTCCG TGACCGGGCC CTGCTGCTGC TCGGCTTCGC GATGATGGGC CGCCGTTCGG AGCTCGCCGC GCTCGACCTG GCCGACATCC GCGAGGTCGA CGAGGGCCTG ATCGTCGTCG TCCGCCGGTC GAAGACCGAC CAAGACGGCC GCGGCGCCGA GGTCGCCGTT CCCTACGGCA GCCGGCCGGA CTCGTGCCCT GTGCGCGCGG TCCGGGCCTG GCGCGGCTGG CTCGCCTCCA TCGGCATCAC CGAGGGCCGA CTGTTCCGTT CCGTAACGAG GCACGGCCAC ATCGGCGAGG CCATGTCCGG CGACGGGATC CGCCGAGCCG TCCGCGCCGC CGCCGTCCGG GCCGGCCTCC CGAACGCGGA CGTCTTCTCG GCTCACTCGC TGCGGGCTGG AGGGGCGACA GCCGCGGCGA AGGCCGGCGC CCCGGTCGCC GCGATCGCCC GGCAAGGCCG CTGGTCACCC ACCTCGCCCG TGGTGCACTC GTACATCCGG GCAGCCGACC GATGGCGGGA CAACCCGATG GCCAGTGTGG GCCTGTGA
|
Protein sequence | MIDPAGTLAP VPSVDLDIVG ARDASVSATT ARMIDDATAA NTKRAYARQW STFTAWCSQE GRTMLPCSDA TLAEYAAQLV TAGAGPATVE QAIATIRRVH RDQAATPPDT RAARLVLRTA RRERADAGQA TRQAPPAALD QLRAMLAACD TSTRGVRDRA LLLLGFAMMG RRSELAALDL ADIREVDEGL IVVVRRSKTD QDGRGAEVAV PYGSRPDSCP VRAVRAWRGW LASIGITEGR LFRSVTRHGH IGEAMSGDGI RRAVRAAAVR AGLPNADVFS AHSLRAGGAT AAAKAGAPVA AIARQGRWSP TSPVVHSYIR AADRWRDNPM ASVGL
|
| |