Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1979 |
Symbol | |
ID | 5670380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2378500 |
End bp | 2379699 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240900 |
Product | glycosyl hydrolase 53 protein |
Protein accession | YP_001506322 |
Protein GI | 158313814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.861107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGGG TCGGCGTGAC CGATTACGAC GCCGGGGAAC GGCGACGCAC CGCGAACAGC GCGGCAACCG CGTTCGTGTT CGGGGCGCGG ACGCATCGGG CGGGTTCGAA GGCGATCATG CGTCGCACGC CGAGCATGCT AAGGCTGCTG TCGATGGCGT TGCTCGCGAT GGTGGCGCTC GCGGGCTGCG TTACCCCCAT CGGGGGTGGA GGCGGCCCCG CTCCGACGAC CAGTGCGGCC CCCAGCCCCG CCGGCCCGAC GGGCTCGCCG TCCGCCGGCC CCACCCAGGC TCCGGACCCG AGCTCGGTTC CGAACCCCGG CGTCACCCCG GTGCCGACCC TCACCTCGGC GCCGGTGACC GCTGCCCCGG TGACCACTGC CCCGGTGACC ACTGCGCCGG GGGCCACGCC GAGCGCCGCG CCAGTGCCCA CGACACCCGG GTCGACTGAT CCGCCTATCG GGACGGTTGC GAAGGGCGCG AGCACCTGGT ACTTCGACAA GATCGCTCCG TCGATGACGG AGGCGGGAGT CTCGTGGTTC TACACCTGGG GAGCCGCGCC GGAGCGGATC GCGGCGCCGG CGGGAGTCGA GTTTGTCCCG ATGATCTGGG GGCCGGGCTC GGTCACTCCG CAGACCCTCG CGACGGTGAA GGCCAACGGC CGCACCCTGC TCGGCTTCAA CGAGCCTGAC CTCCGCGGCC AGGCGGACAT GCCGGTTCAG ACCGCTCTCG ATCTGTGGCC GCAGCTGGAA GCCACCGGAA TGCGTCTGGG TAGCCCGGCG CCCGCCGCCG GCGCCGCGGA CCCGAACAGC TGGTTCGGCC AGTTCATGGC CGGCGCGGCC CAGCGGGGCT ACAAGGTCGA CTTCATCGCG CTGCACTGGT ACGGCAGCGA CTTCGACCCC ACCCGGGCGA CCGGTCAGCT CCGTGCCTAC ATCCAGGACG TGTACGACCG ATACCACCTG CCGATCTGGC TGACCGAGTA CAGCCTGATG AACTTCTCAA CCTCACCCGC GACGGTCCCG AGCGCTGAGG GTCAGGCGGC GTTCGTGACT GCCTCCACCG CGATGCTCGA GAGCCTTCCG TTCGTTGAGC GCTACGCCTG GTTCGCGTTT CCCGCCAATC CCGACAGTCG GACCGGCCTG TATGACGAGT CAGGCCAGCC GACCCCGGCT GGTGTCGCCT ACCAGGCGGC CGGCCGCTGA
|
Protein sequence | MRRVGVTDYD AGERRRTANS AATAFVFGAR THRAGSKAIM RRTPSMLRLL SMALLAMVAL AGCVTPIGGG GGPAPTTSAA PSPAGPTGSP SAGPTQAPDP SSVPNPGVTP VPTLTSAPVT AAPVTTAPVT TAPGATPSAA PVPTTPGSTD PPIGTVAKGA STWYFDKIAP SMTEAGVSWF YTWGAAPERI AAPAGVEFVP MIWGPGSVTP QTLATVKANG RTLLGFNEPD LRGQADMPVQ TALDLWPQLE ATGMRLGSPA PAAGAADPNS WFGQFMAGAA QRGYKVDFIA LHWYGSDFDP TRATGQLRAY IQDVYDRYHL PIWLTEYSLM NFSTSPATVP SAEGQAAFVT ASTAMLESLP FVERYAWFAF PANPDSRTGL YDESGQPTPA GVAYQAAGR
|
| |