Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3021 |
Symbol | |
ID | 5671403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3556376 |
End bp | 3557491 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241923 |
Product | hypothetical protein |
Protein accession | YP_001507343 |
Protein GI | 158314835 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGGCAG GCTTCGCGAC CTGCGCCGAC GGCGCGGTCT CGCTGCTGAC CCGCGTCTAC GACGGCGGCG CCGCCGAGGT CTCGCAGGTG GAGGGCGCGT TGCGGGCCCT GCGTGAGCTC GCCGGGCCGC GCCGGTTCCT GCTCGTCGGC GACAGCAAGC TCGTCTCCTA CACGAACCTG ACCGCGATCG ACGCGGCCGG CGCCACCTTC GTCGCCCCGG CCCCGCGGAC CATCGTCGGG CCGGCCGCGC TCGCGGCGCA CGACCCGGCC ACCGCGACGA TCGTCGACTG GGCGCCCCAG CGCGAGAAGG ACAAGCTGTT CCACCAGCGC GACGTCCACC GGGTCGTCGA GGGATCCACG ACCCTGCGCG GCCCGAAGGC CACCGACCCG CCGTTCACGA CGCGCACCGT GTACTCCCAC TCGGCCCGCC GTGCCGCGGC GTCCGCGGCC AGCCGGCAGA AACAGATCGA CAAGGCACGC GCCGCGCTCG TCGTGCTGCA CCGCAACCTC GGCACCCACT ACTACCGCGA CGAGGCCGCC GTCCACGCCC GCGTCGAGAA GATCACCAGG GAGTGTCGGG TCGGGGCGTG GCTGCGCACC CACGTCGACA CCAACCCCGA CACCGGCAAA CCGCTGCTGA CCTGGTACTT CGACGAGGCG GCCCTGGACC TGGCGGCGAA CGCCGACGGC TGGTTCGCGC TCCTGACAAA CCAGAGCATC GAGGAGAAGG ACGCCGCCGG GGTCTTCGTC GACTACAAGG GCCAGGAAGC CTCCGAACGG CGCAACAGCG CGTTCAAGGG CCCCCTCGCG GTCAACCCGT TCTACCTGGA GAACAACCAG CGGATCCACG GGCTCCTGCA CGTCGTCGGC CTGGCGCTGC TGCTGTTCTC GCTGATCGAG CGCGAGGCCC GCCGCGCCGC CGGCCCGACC GGGACGGTCG CCGGCCTCTA CGCCCGCCGC CCGGCCAAGC CCACCAGCCG CCTCATCCTC GAAGCCCTCG CCGGCCTGCG CCTCGTGCCC GCTCACGACG GCCAGCCCGC CTACATCCCC CGGCCCACCC CGCTCCAACA GCGCGTGCTC GACCTCCTCG GAGTCGACCC GACCAAACCC CCGTGA
|
Protein sequence | MQAGFATCAD GAVSLLTRVY DGGAAEVSQV EGALRALREL AGPRRFLLVG DSKLVSYTNL TAIDAAGATF VAPAPRTIVG PAALAAHDPA TATIVDWAPQ REKDKLFHQR DVHRVVEGST TLRGPKATDP PFTTRTVYSH SARRAAASAA SRQKQIDKAR AALVVLHRNL GTHYYRDEAA VHARVEKITR ECRVGAWLRT HVDTNPDTGK PLLTWYFDEA ALDLAANADG WFALLTNQSI EEKDAAGVFV DYKGQEASER RNSAFKGPLA VNPFYLENNQ RIHGLLHVVG LALLLFSLIE REARRAAGPT GTVAGLYARR PAKPTSRLIL EALAGLRLVP AHDGQPAYIP RPTPLQQRVL DLLGVDPTKP P
|
| |