Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2179 |
Symbol | |
ID | 5670579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2613266 |
End bp | 2614357 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641241100 |
Product | integrase catalytic region |
Protein accession | YP_001506521 |
Protein GI | 158314013 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3415] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.376991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTAA CTGATAATCA ACGCAATATT CTCGAGTCCT TGGCTGGAGG CGGTGGGTAC GACAGGTCTG CGGCTGCCCG CGCCCGTATG GTGCTATGGC GGGACGAAGG ATTCTCAGTG CGGGAAATAG CCGAGAAGGC GGGCGCGTCG AAGCCTACCG TGCGACTGTG GCTGTCGCGC TATGACGAGG AGGGGCCGGA CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACCCC TCCACCGGAG ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACAATCT CCAGCCGCAC CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG TTCCATTTCA CCCCGTCCGG CAGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT CTCACCCGGC ACGCACTCCA GCGCGGGGCG TTCGTCTCGG TTCAGGACCT CGTCAACACC ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC ACAGAAGAGA TCGTAGCCAA GGTGGAGGTA ATCCACCGGG AATTCAGGAA GCTGCTCGCC AACAACTTGT GA
|
Protein sequence | MILTDNQRNI LESLAGGGGY DRSAAARARM VLWRDEGFSV REIAEKAGAS KPTVRLWLSR YDEEGPDGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG VSVSHTFVAQ LWRENNLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVT FHFTPSGSSW LNQVENWFGI LTRHALQRGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT TEEIVAKVEV IHREFRKLLA NNL
|
| |