Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4376 |
Symbol | |
ID | 5672729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5221516 |
End bp | 5222703 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243245 |
Product | integrase catalytic region |
Protein accession | YP_001508662 |
Protein GI | 158316154 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGGCA CGAATGGGGT GTGGTCGCTG CTCTACGCCC TGACACGCAA CGCTCTCGGA CTGATGCTGC TCCGGGTCCG TGGGGACACC GCGAAGGACG TGGAGCTCCT TGTCCTGCGA CATCAGGTGG CGGTACTGCG ACGGCAGGTG AACCGTCCGA CGTTGGAACC GGCGGATCGG CTGATCCTCG CGGCGCTGTC CCGGCTGCTG CCCCGGGCCC GCTGGGGTTC GTTCTTCGTC ACCCCCGCCA CCGTGCCGCG CTGGCACCGG GAACTCCTCG CACGCCAATG GACCTACCCG CGGAAGTCGC CTGGGCGGCC ACCGGTCCGC CGGGAGATCC GCGAGCTGAT CCTGCGCCTC GCACGGGAGA ACCCGACCTG GGGCCACCGC CGGATCCACG GCGAGCTCGT CGGGCTGGGT TACACGGTCG GGGTCGCCAC TGTCTGGCGG ATCCTGCACC GCGCCGGTGT CGACCCCGCA CCCCGCCGGG CCGACACCTC CTGGCGCACG TTCCTGTCCG CCCAGGCCTC CGGCCTGCTG GCCTGCGACT TCTTCACCGT GGACACCGTG TTCCTCCAAC GGATCCACGT GCTCTTCGTC GTCGAACACA CCACCCGCCA CGTCCACGTC CTCGGGGCCA CGAAACACCC GACCACGGCG TGGGTCACCC AGCAGGCACG GAACCTGCTG ATGGACCTCG ACGAGCGTGG CCACCGGTTC CGGCTCCTCA TCCGTGACCG CGACACGAAA TTCACGGCCT CGTTCGACGC TGCCTTCGCC GGGGCCGGCA TCGACGTGAT GCGCACACCG CCACAGTCAC CGAAAGCGAA CACGATCGCG GAACGCTGGG TCGGCACCGT CCGCCGCGAA TGCACCGACC GACTACTGAT CGTCTCCGAA CAGCACCTCA CGTCGGTCCT CAGCAGCTAC GCCAAGCATT TCAACACCCA CCGACCCCGC CGCTCCCTCC ACCAGCACCC ACCCGACCCG CCACCGATGG TCACACCGAC CCCGGAGTCC GCCGTCCGTC GCACACGCAT CCTCGGCGAC ATGATCAACG AGTACCGCAA CGCCGCCTGG CGACGCCCCC AAACGATCAC GTCAGCTGCA AAAGAGCAGA TCAGAGGCCG AAACCCAAGT TCTGGAGCCC CACACCCTCG TTGCTGTTCT CGGGCACGAT TGCTGTAG
|
Protein sequence | MSGTNGVWSL LYALTRNALG LMLLRVRGDT AKDVELLVLR HQVAVLRRQV NRPTLEPADR LILAALSRLL PRARWGSFFV TPATVPRWHR ELLARQWTYP RKSPGRPPVR REIRELILRL ARENPTWGHR RIHGELVGLG YTVGVATVWR ILHRAGVDPA PRRADTSWRT FLSAQASGLL ACDFFTVDTV FLQRIHVLFV VEHTTRHVHV LGATKHPTTA WVTQQARNLL MDLDERGHRF RLLIRDRDTK FTASFDAAFA GAGIDVMRTP PQSPKANTIA ERWVGTVRRE CTDRLLIVSE QHLTSVLSSY AKHFNTHRPR RSLHQHPPDP PPMVTPTPES AVRRTRILGD MINEYRNAAW RRPQTITSAA KEQIRGRNPS SGAPHPRCCS RARLL
|
| |