Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2537 |
Symbol | |
ID | 5675697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3016135 |
End bp | 3017805 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641241453 |
Product | recombinase |
Protein accession | YP_001506873 |
Protein GI | 158314365 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGGT TCGCGTTCGA GGGTCGCTGC TCCACCGAGG ATCAGCAGGA TCCCGAGTCC TCACGGGCAT GGCAGATAAC CCGCGCCAAG GCGCTGATCG AGCCGCACGG CGGCGAGATT GTCACGGAGT ACTTCGACGC CGGAAAGTCC CGGTCCATTC CTTGGCAGCG GCGCCCCATG GCTAACGCCC TCCTCCAGGC CCTGAAAGAT CCCCAGCGCG GTTTCGAAGC GGTGGTCATC GGCGAGCCGC AGCGCGCCTT CTACGGCAAC CAGTTCGGCC TGGTATTTCC CCTGTTCAGC CACTACCAGG TGCCTCTGTG GGTGCCCGAG GTCGGTGGGC CGATCGATCC CGACAACGAG GCCCACGACC TCATCATGTC GGTGTTCGGC GGGATGTCCA AAGGTGAGCG TAACCGGGTA AAGATCCGGG TGCGTGCGGC GATGGCAGCC CAGGCCAAGG TAGAAGGCCG TTTCCTGGGC GGCCGGCCCC CGTACGGGTA CCGGTTGATC GATCTGGGCC CGCATCCGAA TCCGTCCAAG GCTGCGGACG GTCGGCAGCT CCACGGCCTC GCGCTGGACG AGGTGGCCGC ACCCGTCGTG GTTCGGATCT TCGCCGAGTT CCTCCGCGGT CACGGCATCT TCGCGATCGC GGAAGGGCTC ACCCGGGACG GCATCTCCAG TCCTTCCGCC CACGACCCTG CCCGCAACAG CCACCGCAGC GGGAAGGCAT GGTCGAAAGG GGCTGTCCGC GCGATCCTGA CCAACCCCCG CTACACCGGC CGGCAGGTCT GGAACCGCCA GCGCAAGGAC GAGGTGCTCC TCGACGTCGA GGACGTCAGC CTCGGACACA CCACCAAACT GCGATGGAAC CCCGAAAACA CGTGGATCTG GTCAGAGGAG ACGGTCCACC CGGAGATCAT CGACATCGAG ACCTTCACAC AGGCGCAGGA ACTTCTCGCG GGCCGTGGAC GAGGAGCGGG TGACCAGAAG ACCCCCCGGA CGCGCCAGCC CTACGCCCTG CGCGGGGCGG TCCATTGCGG TATCTGTAAT CGCAAGATGC AGGGACACAC CGTCCGCCGC GCCACCTACT ACCGGTGCCG CTACCCGCAG GAATACGCCC TGGCGAACAC GGTCACCCAT CCGGCGAACG TCTACGTCCG CGAGGACGTC CTCGTCCCGG CACTCGACGG CTGGCTCGCG GACACCCTCA CACCGCCCCG GCTGGCCGAG ACCCTGGACG CCATGGTGGC CGCCCAGGCG AGTCCATCCG TCGACGACCT GGCCGCGCAA CGAGCCCGCC AGACGATCGA GGAGAGCAAC GCCAAGCTCA CGAAATACCG GGCGGCACTG GACGCAGGAG CGGACCCGGC AGTCGTGACC GGATGGATCG CTCAGGTACA GGCTGAGAAG ACCGCTGCCG AGCGGGACCT TCGCGAAGCG CAGGAGAGCG ACGTACGGCA GCTGACACGC GACGAGATCA GTAGCATGGT GGAGTCACTC GGCGAGATCG CCAGCGCCCT AGCGGAGGCG GAGCCCGTCG AGAAGACGGA TCTGTACCGA TCGCTCCAAC TACGCCTGAC TTACCATCCC ACAACCAACA CGGTAAGGGC CGACATGAAG ATCGACACAA GTTACCGTGG GGTAATGGAT CGTGTCCGAG GGGGGACTTG A
|
Protein sequence | MIRFAFEGRC STEDQQDPES SRAWQITRAK ALIEPHGGEI VTEYFDAGKS RSIPWQRRPM ANALLQALKD PQRGFEAVVI GEPQRAFYGN QFGLVFPLFS HYQVPLWVPE VGGPIDPDNE AHDLIMSVFG GMSKGERNRV KIRVRAAMAA QAKVEGRFLG GRPPYGYRLI DLGPHPNPSK AADGRQLHGL ALDEVAAPVV VRIFAEFLRG HGIFAIAEGL TRDGISSPSA HDPARNSHRS GKAWSKGAVR AILTNPRYTG RQVWNRQRKD EVLLDVEDVS LGHTTKLRWN PENTWIWSEE TVHPEIIDIE TFTQAQELLA GRGRGAGDQK TPRTRQPYAL RGAVHCGICN RKMQGHTVRR ATYYRCRYPQ EYALANTVTH PANVYVREDV LVPALDGWLA DTLTPPRLAE TLDAMVAAQA SPSVDDLAAQ RARQTIEESN AKLTKYRAAL DAGADPAVVT GWIAQVQAEK TAAERDLREA QESDVRQLTR DEISSMVESL GEIASALAEA EPVEKTDLYR SLQLRLTYHP TTNTVRADMK IDTSYRGVMD RVRGGT
|
| |