Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1998 |
Symbol | |
ID | 5670399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2399586 |
End bp | 2402519 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240919 |
Product | CoA-binding domain-containing protein |
Protein accession | YP_001506341 |
Protein GI | 158313833 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0769955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTCGGC GACCGGCTCG TCGGACATCC CCCCGCGGGC ATCCGTCGGC CGCCACCTGG TCTGCGCCAT TGTGCAATCC AATGCCGGAC GGTTACCGAC ACGGGGTCGA GCGACGGGCC CGCGGCGGAC GACGGAGGCG GCGCGGTCCG GCGCGGAATA GCGTCCACGG GGTGACATCT CGCTACCCGG CGCACTGGGC GGCCGACGGC GTTCTCGCCG ACGGCGCGCC CGTTCAGCTG CGCCCCGTGC TGGGCACCGA CGCGCCGGGT CTGGCCGAGC TGCGGTCCGG CCTGTCCGCG GCGGACGTCG CGCGCCTGCC CGCGCGGTGG GCGCGGCGCT CCCCGGAAGA GCTCGCCGGG CACCTGACCG CGGCCGGCGA CCCGGCCGCC GCCGGTCACC CCACCATCAC CGGTCACCCC ACCACCGGTG ACAGCGGGCG GCTCGCCGTC GCGGCCGTGC TGCGCGGACG TCTGGTCGGC ACGGCGGACT ACGAACGGAT CGCCGGCTCC GACGACGCGG TGGTCGCCCT GGTCGTCGAG GCTGCGCACC GCGGCCGTGG GCTCGGACTG CTGCTGCTCG AACACCTGAT CGCCGCCGCC CGCGAGCGCG GGGTGAGCCA CCTCGTCGCC GATCTGCGCG CGGGCGACGA CCGGGCGCTG CGGGTCTTCC ACGCCGCGGG CTTCGCCGGC GCCGAGACCC GCCCGCGCAC CCCCCGGCAG GACGGCGCTG GTCAGGACGG CGCTGGTGGC GTGCGGGTGG TGTTCCCGAC GGCACAGACT CCGCGCACCC GGGGCATCTC CCGGGCGCTG GAACAGCGGG CGGAGGCACG GAGCATCGCG CGGCTGCTCA CACCGCGCGC GGTCGCGGTC GTCGGCGCGA GCCGGCAGCC CGGCAGCGCC GGCCACGAGG TCTTCCGCCG GCTGCTGGCC AGCGATTTCC ATGGCCCGGT CTACCCGGTC AACCCGGCGG CGCGCCAGGT CGCCTCGGTC TACGCCTACC CGGACGTCCG CGAGATCCCG GACGCGGTCG ACCTCGCCGT GATCGCCGTC CCGGCGCCGG CCGTGGCCGA CGCGGTGCGG GCCTGCGCGG AGAAGGACAT CCGTGGCCTG ATCGTCGTCT CGGCCGGGTT CGCGGAGGCC GGGCCCGACG GGCGGGCCCG GCTCGCCGAG GTCACCCGGC TGGCCCGGGA GTCCGGCATG CGGCTGATCG GCCCGAACGC GATGGGCGTG ATCAACACGG ACCCGGCCGT CCGCCTGCAC GCCACCTTCG CGGCCGGCGA CCCGCCGGTG GGAAGGGTCG GCGCGTTCAC CCAGTCGGGG GCGCTGGCCG GGACGTTCCT CACCGAGGCG TCGCGGCGCG CGATCGGCCT GTCCACGTTC GTCTCCACCG GTGACCGCTC GGACGTCTCG GCCAACGACG TGCTGCAGTA CTGGCAGTCG GACCCGCACA CCGATGTGAT CATGCTGCAC CTGCAGGGGT TCGGCAACCC GCGGAAGTTC GCCCGGATCG CCCGGCGGGT GGGCCGACGC AAGCCCGTGA TCGCGCTGAA GAGCGGGCGC AGCGCCGCCG ACCCGGCCCT GGACGCCCTG TTCACCAGCG CCGGGGTCAT CCGGGTGGAC ACGTTGAGCC AACTGTTCGA CCTGGCCGCG CTGCTGGCGT CCCAGCCGCT GCCCGCCGGG CGGCGCGTCG GCGTCGTGGG GACGTCCAGC GCGCTCGCCG CCCTGGCGAC GGACGCCTGC CGGACGGCAG GCCTGGAGGT ACCGCCCTTC TCCACCGCCA CGGCGGAGGC GTTGAGCGAC ACGCTCGGCC GCCCGGAGCC GGCCAACCCG GTGGACCTGG GCGCGATGGC CGCGCCCGAA CGGTTCGAGC GCGCGCTGCG CGCGGTCGCC GCCAGCGCGG ACGTCGACGC CGTGCTGGCG CTGATCACCC CGCACCCGGC CGTCGAGGAG CTCGCGCGGG CCGTGCGGGC CGTGGCGGGC TCCGGCCGGG TGCCCGTGGT GGCCTCCTAC CTCGGGTACG ACGGGATGCC GTCCGCGCTG GCCGCCCCGG GCGACGGCAT CGTGACGCCC GCACCCGGCT CCGTGCCGTC GTTCGCCTCC CCGGAGTCGG CCGCGCTCGC GCTCGCCCGG GCGGCGGGCC ACGCGGCGTG GCGCAGCCGC GAGCAGGGCG CCGTCCCCAC TCTCGACCGG CTCGACCTGG ACCGCGCGCG CCGCGCGGCG GCCGCCGGCC CGACGGACGG GACGTGGCTG CCCCAGGAGC TGGTCGGCGA CATCCTGGGC GGGGTGGGGC TGGCGGTCTG GCCCAGCGAG CCGGTGACGA GCGCCGCCCA GGCCCTGGAC ACGGCCGAAC GGCTCGGCTG GCCCGTCGCC CTGAAGATCG CCGACGAACG CTTCCGTGGG CGGCTGGACG TCGGAGCCGT CCGGCTGGGC GTCGAGGGGC CCGGCGCGCT CGCGGAGGCC TGGCGCACGA TCCGCGCCGC GGTCGGGCCG GGCGACATGG TCGTCCAACC GATGGCGCCG GCCGGGGTGT CGACCGTGAT CCGGATGACC CAGGACCCGG CGATCGGGCC GCTGCTGTCG CTGCGCCTCG GCGGGGCCGT CGCGGACCTG TTGGTCGACC CGCTGGCCCG GGCGCTGCCG ATCACCGACC GGGACGCCGC CGAGATGGTG CGGGGTATCC GCGGCGCGGT GCTGCTGGTC GGCGGCGCCG GCACCCCGGC GGCGGACACG GCCGCCCTGG AGGACGTGCT GCACCGGCTG GCCCGCCTCG CCGAGGAGGT GCCGGCGGTC GCCGAGGTGC TCCTGGATCC GGTGCTCGTC GGCCGGCCCG GCGTGGTCCT ACTGCATGCC GGCGTCCGCC TGCTCCCGCC GGGAACCGAT CCCGAGTCAC TGCCCCGGCG GATGACGGGC TCCGGCGTCG AGTACTTCCG CTAG
|
Protein sequence | MCRRPARRTS PRGHPSAATW SAPLCNPMPD GYRHGVERRA RGGRRRRRGP ARNSVHGVTS RYPAHWAADG VLADGAPVQL RPVLGTDAPG LAELRSGLSA ADVARLPARW ARRSPEELAG HLTAAGDPAA AGHPTITGHP TTGDSGRLAV AAVLRGRLVG TADYERIAGS DDAVVALVVE AAHRGRGLGL LLLEHLIAAA RERGVSHLVA DLRAGDDRAL RVFHAAGFAG AETRPRTPRQ DGAGQDGAGG VRVVFPTAQT PRTRGISRAL EQRAEARSIA RLLTPRAVAV VGASRQPGSA GHEVFRRLLA SDFHGPVYPV NPAARQVASV YAYPDVREIP DAVDLAVIAV PAPAVADAVR ACAEKDIRGL IVVSAGFAEA GPDGRARLAE VTRLARESGM RLIGPNAMGV INTDPAVRLH ATFAAGDPPV GRVGAFTQSG ALAGTFLTEA SRRAIGLSTF VSTGDRSDVS ANDVLQYWQS DPHTDVIMLH LQGFGNPRKF ARIARRVGRR KPVIALKSGR SAADPALDAL FTSAGVIRVD TLSQLFDLAA LLASQPLPAG RRVGVVGTSS ALAALATDAC RTAGLEVPPF STATAEALSD TLGRPEPANP VDLGAMAAPE RFERALRAVA ASADVDAVLA LITPHPAVEE LARAVRAVAG SGRVPVVASY LGYDGMPSAL AAPGDGIVTP APGSVPSFAS PESAALALAR AAGHAAWRSR EQGAVPTLDR LDLDRARRAA AAGPTDGTWL PQELVGDILG GVGLAVWPSE PVTSAAQALD TAERLGWPVA LKIADERFRG RLDVGAVRLG VEGPGALAEA WRTIRAAVGP GDMVVQPMAP AGVSTVIRMT QDPAIGPLLS LRLGGAVADL LVDPLARALP ITDRDAAEMV RGIRGAVLLV GGAGTPAADT AALEDVLHRL ARLAEEVPAV AEVLLDPVLV GRPGVVLLHA GVRLLPPGTD PESLPRRMTG SGVEYFR
|
| |