Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5253 |
Symbol | |
ID | 5673587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6314658 |
End bp | 6315725 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244108 |
Product | hypothetical protein |
Protein accession | YP_001509517 |
Protein GI | 158317009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.973786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.278605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACC CGCGCCGCCG CCGGCGCCTG ATCCTCGGCG CGGGCGCGCT GCTCGCCGTG ATCCTCGGGG TGGGCTTCAT CGCTGCCGGA ACGGACGGCG CCGGCACGAA CTCGGCGGAC ACGGCGGCCG TCCCGATGCC GGCGTCCGAG CCCGGTACAC CCGGCGAGAT GAGCACGTCG GCGGACTCCA GCGCGTCGGC GTCGGCGTCG GCGCCGTCGT CGGCGCCCGG TGCGGCCCGG GCGGACGGCA TCGCCGCGGG AGCGCCGTCG ACCGTCCCGG GCGGGCCCGG TGGCACCGGC GGCACCGGCG GCGAGCCCGC CCGGCCCGCC GGCGCCCAGC CGCGGATCGT CCGCAACGGC ACGGCCACGC TGTCCGTGCC GGCCGGTGCC GTGGACAAGG CGGTCCAGGA TCTCTCCGCG GCGGCGCGGG GGCTTGCGGG GTACACCGAG TCCAGCGAGG TCAGCGGCAC TCCGTCGACC ACTGATGACG GCAGCCAGTA CGCCACCGTG ACCCTGCGGG TGCCGAGCGA GTCGTTCGAC GAGCTGCGGT CCGGCCTGAG CCGGATCGGC ACGGTGTCGG CGTCGACGAT GTCCTCGCGC GACGTGACCG GGGAGTACGT CGATCTCGAG GCGCGCAAGC GCGCGCTGGA GGCCTCCCGC ACCGCCTACA CGACGCTGCT CTCCAATGCC ACCACGGTGG GGGAGACGCT GTCGGTGCAG CAGGCCATCG ACGGCGTGCA GATCCAGATC GAGCAGATCG AGGGCCAGCG GATGGTCCTC GCCGACGCCA GCGACCTCGC GACGTTGACG GTGCAGATCG CCGAGGACGG AGCGGACCCC GCACCCGGGC CGGACGATGA CGACTCGGGG CTGGTCGCTG CCGCGCGGAC ATCCTGGAAC CGTTTTGTCC GCGGTATCGA GGAGATCATC GCGCTGCTCG GCCCGCTGGC GCTGGTCGGC CTGGTCGCCG CGTGCGTCTA CGGGGCCGTC CGGATCGCGC GCCGGTGGGG CTGGATCCCG ACGACCCCGG CCCCGCCCGC GCCGCCGCGG GACTCGGCGG GGTCGTAA
|
Protein sequence | MPDPRRRRRL ILGAGALLAV ILGVGFIAAG TDGAGTNSAD TAAVPMPASE PGTPGEMSTS ADSSASASAS APSSAPGAAR ADGIAAGAPS TVPGGPGGTG GTGGEPARPA GAQPRIVRNG TATLSVPAGA VDKAVQDLSA AARGLAGYTE SSEVSGTPST TDDGSQYATV TLRVPSESFD ELRSGLSRIG TVSASTMSSR DVTGEYVDLE ARKRALEASR TAYTTLLSNA TTVGETLSVQ QAIDGVQIQI EQIEGQRMVL ADASDLATLT VQIAEDGADP APGPDDDDSG LVAAARTSWN RFVRGIEEII ALLGPLALVG LVAACVYGAV RIARRWGWIP TTPAPPAPPR DSAGS
|
| |