Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4157 |
Symbol | |
ID | 5672512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4937187 |
End bp | 4938287 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243030 |
Product | integrase domain-containing protein |
Protein accession | YP_001508447 |
Protein GI | 158315939 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.118927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.593362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGCG GGCAGCATGA CGGCGGTGAG ACCGTCCGGC GACAGGTCGG TCCCGGGCCG GCGCCAGACG TCGTCGACGC GACCGACATG GTGGACGCGG TGGAGATCGT CGCGGCCGGG CCGGTGGCCG TGCCGGGCAC GGTGGGGCTC GACGGGCCGC TCGCCGGCCG GGTCGGGGAC TACGCGCGGG CGTCCCGGTC GGCGGCGACC TGGCGTGCCT ACGACGCCGA CCTGCGTCAT TTCCGGTCCT GGTGCGAGGG ACGTCCCGTA CCGCTGGTCG CCGTGCCCGC CTCCGCGGTG ACGGTCGCCG GGTACATCAC CGAACTGGCG GACGCCGGGT ATGCGCCGTC GACGATCCGC CGTCGGCTGG CTGCGATCTC GGTGGCGCAT CAGCTCGCCC ATGCCGAGAA TCCGACGGGG TCGGCGGAGG TGTCAGCGGT GTGGAACGGG ATCCGCCGGT CGCGGGGGGT GCGCCCGGCG CGCAAGGCCG CGTTGGACAC GACGTTGCTG TCGCGGGTCG TCGCCGGCCT CGACGACTCG CAGTTGGCGG ATGTGCGGGA CAGGGCGCTG CTGCTGGTCG GGTTCGCCGG CTGTCTGCGT CGCAGTGAGC TGGTCGGGTT GGACACCGCC GACCTGGTGG AGACCGACGA CGGGCTGGTC GTGACGGTGC GCCGTTCCAA GACCGACCAG GAGTCCGCCG GTGCGCAGGT CGGGTTGGCG TACGGGTCGT ACCGGCCGAC GTGCCCGGTG CGGGCGTGGC GGGGATGGGT GGCGGCCGCG GCGGCGGCGG GGACGCCGCT GGCTGGCGGG GCGGCGTTTC GGGGGGTGAA CCGGCACGGG CAGGTCGGCG CGGGCCGGCT CTACCCGGGG TCGGTGGCGC GGATCGTGCA GCGTCGGGTG GCCGCGGCCG GGTTGGATCC GGCGGATTTC GCGGGGCATT CGCTGCGGTC GGGGTTCGCG ACGGCGGCGG CGCGGGCCGG GGTGACGGAC CGGTCGATCA TGCGGCAGGG CCGGTGGCGG TCGGCGGCGT CGTTGGAGTC GTATGTGCGG GCCGGGCGGC TGTTCGACGC GGACAACCCG TCGGGTCGGG TCGGTCTGTG A
|
Protein sequence | MAGGQHDGGE TVRRQVGPGP APDVVDATDM VDAVEIVAAG PVAVPGTVGL DGPLAGRVGD YARASRSAAT WRAYDADLRH FRSWCEGRPV PLVAVPASAV TVAGYITELA DAGYAPSTIR RRLAAISVAH QLAHAENPTG SAEVSAVWNG IRRSRGVRPA RKAALDTTLL SRVVAGLDDS QLADVRDRAL LLVGFAGCLR RSELVGLDTA DLVETDDGLV VTVRRSKTDQ ESAGAQVGLA YGSYRPTCPV RAWRGWVAAA AAAGTPLAGG AAFRGVNRHG QVGAGRLYPG SVARIVQRRV AAAGLDPADF AGHSLRSGFA TAAARAGVTD RSIMRQGRWR SAASLESYVR AGRLFDADNP SGRVGL
|
| |