Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2010 |
Symbol | |
ID | 5670411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2415955 |
End bp | 2417340 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240931 |
Product | hypothetical protein |
Protein accession | YP_001506353 |
Protein GI | 158313845 |
COG category | [I] Lipid transport and metabolism [S] Function unknown |
COG ID | [COG1946] Acyl-CoA thioesterase [COG2343] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00189] acyl-CoA thioesterase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.347959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.266903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTCG ACTCCGGATT CCTCGTCCCA ACCCTGGGCA CAGTGTTAGC TTCCTGTGAC GTGCAGCCTG TAACGGCCGA CGCGACGTCG GCAAGCGATC CGGCCTTAAC TCGGCAGCCA TGCCCCTACC CGGCCCGAGC GTGGTGGGGT GACACCCTGG TCGCGGAGTC GACAGCCGCC GTCCGGGTGC AGGAACCGGG GCAGGTCCCC GTCCTTTACT TTCCACGCTC CGACATCCGG TTCGACTCCT TCGGAGACGA GGGCCGGACG GTGTCGTGCC CGGTGAAGGG CACGGCGCGC CTGTGGTCCC TCGAGGGGAA GGTCGACGCC GACTCCGTCG ACTGGCACGA CCCCGCCAGC ACGACGACGG TCGACGGCCG GGACGCGCTG TGGGCCTTCA CCCAGCCGGC TTCCGGCCTG GAGTGGCTGA CCGACTTCGC CGCCTTCGAC CACGACCGGG TCCGAGTGGA GATCGTCGAC GAGCTCCCCG GCGAGCAGCC CCGCGACAGC ACGATCAAAC GCTTCCCCAC CTGGGGCGAC GCCTCAGATC TCATCGACAT CATGAATGTC CGCCAGGCCG GCGACCGCAG ATACGTCAGC GTGGCCCTTG CTGATCCCCG GCGGCCAGTC GTCGAAGGCA GCCAGATGCT CGGGCAGGCT GTGGTGGCCG GCGGCCGCCA CGCGCCCGGG CGGCGGGTTG TCTCGGCGCA CATGGTGTTC TACCGGGCAG CGGACGCGCG CGAACCTCTC GAGTTCGAGG TGGACGAGCT CAGCTCCGGC CGGAGCTTCA CAACGCTGGC AGTACACGTC TCCCAGCGCG GAAAGCGGCG CGCCAGCGGC ACCCTGCTGC TCGATGTCAC CGCACCGGAC GTCGTCCGCC ACTCCGCGGC CCCGCCGCCA TCCGCGGGCC CCTACGACAG CAAGCCCTAC GACATGTCCG TGACCGGACG CGACATCCGC ATGGTCGACG CGGCGTACAC CGACGACCCG GCCGCGCCCG TGGGTCCTCC GGTCATCGGC ACATGGGTCC GGTTCCGCAC GGTGCCGGAC GACCCGTGCC TACAGGCCGG GCTGCTCGCC CAGTTCACCG GGCACATCTC CATCGCCGCC GCCCTGCGCC CGCATGAGGG GGTCGGCCAG AACCAGGCCC ATCGCACGCT GTCGATGGGG ATCAACGCGA TTGGCATCTC CTTTCACTCC AATGTCCGGG CCGACCACTG GATGCGCTAT CATCATCTGT CGACCTTCGC CGGCGACGGA ATGACCCATT CCGAATGCCG GGTCTACGAC GAGGCCGATG CGCTGATCTC GTCGTTCACC GTCGACGCCA TGCTGCGGGG CTTCAGGAAT GACGGGCGCG CGGTCGACGA CAGGACCGCG CTGTGA
|
Protein sequence | MPLDSGFLVP TLGTVLASCD VQPVTADATS ASDPALTRQP CPYPARAWWG DTLVAESTAA VRVQEPGQVP VLYFPRSDIR FDSFGDEGRT VSCPVKGTAR LWSLEGKVDA DSVDWHDPAS TTTVDGRDAL WAFTQPASGL EWLTDFAAFD HDRVRVEIVD ELPGEQPRDS TIKRFPTWGD ASDLIDIMNV RQAGDRRYVS VALADPRRPV VEGSQMLGQA VVAGGRHAPG RRVVSAHMVF YRAADAREPL EFEVDELSSG RSFTTLAVHV SQRGKRRASG TLLLDVTAPD VVRHSAAPPP SAGPYDSKPY DMSVTGRDIR MVDAAYTDDP AAPVGPPVIG TWVRFRTVPD DPCLQAGLLA QFTGHISIAA ALRPHEGVGQ NQAHRTLSMG INAIGISFHS NVRADHWMRY HHLSTFAGDG MTHSECRVYD EADALISSFT VDAMLRGFRN DGRAVDDRTA L
|
| |