Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3003 |
Symbol | |
ID | 5671386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3533240 |
End bp | 3534331 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641241906 |
Product | integrase catalytic region |
Protein accession | YP_001507326 |
Protein GI | 158314818 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3415] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTAA CTGATAATCA ACGCAATATT CTCGAGTCCT TGGCTGGAGG TGGTGGGTAC GACAGGTCTG CGGCTGCCCG CGCCCGTATG GTGCTATGGC GGGACGAAGG ATTCTCGGTG CGGGAAATAG CCGAGAAGGC GGGCGCGTCG AAGCCTACCG TGCGGCTGTG GCTGTCGCGC TATGACGAGG AGGGGCCGGA CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACTCC TCCACCGGAG ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACAATCT CCAGCCGCAC CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGATGTC GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT CTGGCCACCC ACTACGGCCC TGACGTCGAC ACATGGCTAC GCAGGCACAA GAACGTCGCG TTCCATTTCA CCCCGTCCGG CGGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT CTCACCCGGC ACGCACTCCA GCGCGGGGCG TTCGTCTCGG TCCAGGACCT CGTCAACACC ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC GCAGAAGAGA TCGTAGCCAA GGTGGAGGTA CTCCACCGGG AATTCAGGAA GCTGCTCGCC AACAACTTGT GA
|
Protein sequence | MILTDNQRNI LESLAGGGGY DRSAAARARM VLWRDEGFSV REIAEKAGAS KPTVRLWLSR YDEEGPDGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG VSVSHTFVAQ LWRENNLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVA FHFTPSGGSW LNQVENWFGI LTRHALQRGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT AEEIVAKVEV LHREFRKLLA NNL
|
| |