Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4005 |
Symbol | |
ID | 5672364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4786845 |
End bp | 4787837 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242882 |
Product | cyclase/dehydrase |
Protein accession | YP_001508299 |
Protein GI | 158315791 |
COG category | [S] Function unknown |
COG ID | [COG5637] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.290687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.973287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGA CGACGGAGCA ACCACACGGC CTGGTCGGCG CCCTCCTGCG CAGTCCGGCG AGCAAGAGGC TCGCCGACCA GGCGGGAGAC CTGGCGAAGG CGGGCGGCAG CCGGCTGGCC GGGCGGGTCG GTGAGCGACT CACCTCCAGC ACGGACAAAC TCAACGGTCT CAGCGACTCC GGCGGGCGGA TCTCGTCGCT CGCCGAGGGC GCCGGGAGAC TGGTCGAGGG CCAGTCTCCG CTGAAGGCGG CGGCCGGGAC GATCATGTCG AACGTCAAGG ACAAGGTGAA GGGGGCCCTC GGCGCCGCCA AGGGCAGGGC CGGCGGGAAG AGCGGGCCGC CCAAGGCGAT GAACATCGAG GAGGCCGTGG ACGTCGGCGT CCCGGTGTCG GTCGCCTACG ACCAGTGGAC GCAGTACCCC GAGTTCGCCA AGTTCATGAA AGGCGTCGAG GCGGTCGAGA CCAAGAGCGA GACCGAACAG AACTGGCGGG TCAAGGTCTT CCGCTCCAGG CGCAGCTGGC AGGCAAAGGT CACCGAGCAG ATCCCCGACC GCCGGATCGT CTGGACATCC GAGGGAGCGA AGGGCTCGGT CAAGGGCGCC GTCACCTTCC ACCCGCTCGC CGACGACCTG ACGCGCGTGC TGCTGGCGAT GGAGTACTAC CCCAGCGGGT TCATGGAGAA GACCGGCAGC CTGTGGCGCG CCGGTGGGCG ACGGGCCCGG CTGGATCTGA AACACTTCCG CCGCTTCGTC ATGTTCAGTG GGGAGGCCAC CGGCTCCTGG CGGGGCGAAA TCCGGGACGG CAAGGTCGTT CGCTCCCCGG ACGAGGACCA ATCCGAACCC CGCGAAACCC GGGAGGAGGC GCGGAGCTCG GAAACCACGC AGGCCGCACA GGACAGTGAC GACCAGGGCG CGGCGACGTC CGGGGAATCC GAGGTGCGGG CCGGGTCGAA GTCCGACCAG CCCGCCGGTG CCGACGAGCC TGTCCGAGCC TGA
|
Protein sequence | MTRTTEQPHG LVGALLRSPA SKRLADQAGD LAKAGGSRLA GRVGERLTSS TDKLNGLSDS GGRISSLAEG AGRLVEGQSP LKAAAGTIMS NVKDKVKGAL GAAKGRAGGK SGPPKAMNIE EAVDVGVPVS VAYDQWTQYP EFAKFMKGVE AVETKSETEQ NWRVKVFRSR RSWQAKVTEQ IPDRRIVWTS EGAKGSVKGA VTFHPLADDL TRVLLAMEYY PSGFMEKTGS LWRAGGRRAR LDLKHFRRFV MFSGEATGSW RGEIRDGKVV RSPDEDQSEP RETREEARSS ETTQAAQDSD DQGAATSGES EVRAGSKSDQ PAGADEPVRA
|
| |