Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2458 |
Symbol | |
ID | 5670854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2925389 |
End bp | 2926948 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241375 |
Product | hypothetical protein |
Protein accession | YP_001506796 |
Protein GI | 158314288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00115564 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.62035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG GCGGCAGGTA CTGCGGCTGC GGTGATTGTG CGAACTGTGT ACGATGCGTG ACTGTGACGA TGCAGGCTCC TGCTGCCGTC CGGTCCGTTC GCCAGTTTCT CGCTGTCGGC GCGGGCTCGC GGTTCGTCGG GGTCCTCGCG GTGCTGATCG GGGCGCTCCT GCTCGCGGGA ACGGTGAGCC GCGCGGTGGT CGCCCAGCGT CTCGACGCGA CGGAGAGCAT CCGGGAGCGG TCCGCGGCTG CGCTGATCAT CGCCGAGCGG CTACGGGCGG CCCTCGCCCG GGCGGACGCG ACCGCCGCGG CGAACGTCTT CTCGTGGGTC GACCTCGTGC CGTCCCGCCC GTCGTCGCCC GCGGAGCGCC GGGAGAACTA CGACGAGGTG AAGGCCGGCG GCTACTACAC CGGTGACCAC AGCGGGGACG CCGCCGTCTC CGGGCTCGCG GGCGACGACA CGGCTTCCGC CGGGGACGGG CCGGCCGCCC CCAACCAGTA CAGCGCCGCG GTGCGCTCGG CGACGACGGC GCTGCTCGAA CTGCGCGAGG TCGACCCGGG GAAGTGCGAG TCGGCCCGCG CACCCGGTTC GACGTCGCAC TGCGCGATGG ACCGGATCGC GACCCTGATC CCGGAGTACG TGGGGCTGGT CGCCGAGGCG TCGGCGAACA CCCGCGCGGG GAACACGGTC GGCGGCTCCT ACCAGCGCCT GGCGTCCGAC CTGATGGCGA CGAGCATCCT CGGCCAGACG AACCTGCTGG TCACGGCCTA CGGCGAGCGG GTGGACGCCG ACTACCGCCG GGCGACCGGC GGCCACGGCG AGGATCTGCT GCTCCTCGTG CTGGCCTGCG CGCTGGTCGC CCTCGTCGGC GCGCAGGTGT ACGTGTACCG CCGCACGCGG CGGATCCTCA ACGCCGGCCT GCTGGCCGCG ACCGGCGCGC TCGGCGCGTT CGTCGTGGCC TGCCTGGTCC TGCTCGGGGG GCAGCAGAGC CAGCTTTCCG CCGCGCAGCG CGGCGACTTC GTGCCCATGA CCCTGCTGGC CGCCTCGCGC ACCCTGGCCC TCGAGGCGCG GACCTGCGAG TACCTGTCCC TTGCGTCGCT GGGCAACGGC GACGGCGCCG ACGCGTGCTT CCAGCGCGCG GTCGCCGGCC TCGGCTACGG CCTGGACGGC CGCCCCACCC CGACGGCGGG TGCCCTCTCG GCCGCTCTGG CCGAGCTCCC CGCCGAGCGC GCCGAGCTGG CGGCGGAGTT CACCGCCTGG CTGCGGGAGC GCGGCGAGGT GGTCGCGACC CTGCGGCGGC CCCCGCGCCC GACCGCCGGC GCCGGCAGCC CGGACGTCTT CGACGACGTC GTCGGTGTCA CCCTGCGGTC GGAGCACTTC GACCGGTTCA TCACCCGGGT GGACGGGCTG GTGGGCGGCC ATCTCGACGG CTTCAACGGT GAGGTGGACG CCGGGGCCGC CGACGTGCGC CTCCTCGCCG CCGTCCTCCC GATCGCCCTG CTCACCGCGG CGTCCTGCTG CGTGATCGGG ATCAGTGCCC GCGTCCGGGA GTACCTGTGA
|
Protein sequence | MSAGGRYCGC GDCANCVRCV TVTMQAPAAV RSVRQFLAVG AGSRFVGVLA VLIGALLLAG TVSRAVVAQR LDATESIRER SAAALIIAER LRAALARADA TAAANVFSWV DLVPSRPSSP AERRENYDEV KAGGYYTGDH SGDAAVSGLA GDDTASAGDG PAAPNQYSAA VRSATTALLE LREVDPGKCE SARAPGSTSH CAMDRIATLI PEYVGLVAEA SANTRAGNTV GGSYQRLASD LMATSILGQT NLLVTAYGER VDADYRRATG GHGEDLLLLV LACALVALVG AQVYVYRRTR RILNAGLLAA TGALGAFVVA CLVLLGGQQS QLSAAQRGDF VPMTLLAASR TLALEARTCE YLSLASLGNG DGADACFQRA VAGLGYGLDG RPTPTAGALS AALAELPAER AELAAEFTAW LRERGEVVAT LRRPPRPTAG AGSPDVFDDV VGVTLRSEHF DRFITRVDGL VGGHLDGFNG EVDAGAADVR LLAAVLPIAL LTAASCCVIG ISARVREYL
|
| |