Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4204 |
Symbol | |
ID | 3907169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5019607 |
End bp | 5021232 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881532 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_483281 |
Protein GI | 86742881 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGTT ACACCGGCGC CTTCTCTTTG GAAATCCCGG ATTTATGGAA AAACCATGAG AACTGCGCTT TCCGTATTCT TTACGGACTG GTCCAGAGGC CGAACTCGCT CGTGGTGCGC TGGCGTGACA GAGAAATCAC GGCCGAGCTT CTCGCCGAGC TGATCCTGAA TGCTGCAGAA ACCTTCCTGC AATGCGGTGC CGGCCCACGC GAAGCCATTG CCGTCCTTGC ATCGTCCAAT CATCCGGCGA TGCTCGTCTG CCGATATGCG GCACACCTCA TCGGGGCATC CGTCGTCTAC GTCCGGGCCG CCAACCCCCG CACCGACGCC GAAATGCTGT CGCGCCAGGT GCAGTCCCGG ATCCTCGAGG AGGTCGGGGC CCGCGTCCTG GTCGTCGACG CATCACACAT CGACCGGGGA CGAGCTCTCG TGGCGTCCGC CACGACGCTG CTCACCCTCG TGACGGAAAC GTACCCCGCG GTGCCGCTCG ACACAGGTCG TACGCCCCGG CTGCCTGATC TTCCCCCGTA CGACGGCGAC GCGCGTGCTC TGGTCACGTT CACCAGCGGA AGCACGGGAC AACCCAAGAG CTTGTCTCAG TCGTACCGCA CCTGGAACGC GACCGTCCGC GGGTTCTCCG GCCGGACCGA TACCCACCTA CCGTCGCGGA TCCTGGCCGT GACGCCGGTC AGCCACACTG TCGGCTTCAT GGTCGACTCC GTCCTCGCCG CCGGTGGCAG CGCGGTCCTG CATGAAGGCT TTGACGCCGG CACCGTGCTC AGCGATGTCG CCAGGCACCG GATCACCGAT ACCTATCTGG CCGTTCCGCA CCTGTACCGC CTGGTTGAGC ATGAGGACCT TCCCCGCACC GATGTGTCGT CCCTGCGTCG GCTCATCTAC AGCGGCACCC CGGCCGCGCC ACGTCGGATC GCGCAGGCGG TCCCCTGCTT CCGCGACGCC ATCGTCCAGC TCTACGGCAC GACGGAAGCG GGCGGCATCT CCAGCCTCAC GCCGCTGGAC CACCAGGAGC CCGAACTCCT GCCGACGGTC GGGCGGCCCT TCCCCTGGGT GCAGGTCCGC ATGTGTGACC CGGACACCGG CGCTGAGGTG GAGCGAGGCC ACGTGGGGGA GGTGTGGACG TACTCGACAA CAGTGATGGA CGGCTACCTG GAGACCGGCG TCCCCACGCA CAGCACTCTG CGAGACGGCT GGCTGCGCAC GGGTGACCTC GGCTACTGGG ACCAGTACGG CTATCTGCGG CTGGTCGGCC GGGTCGGCCA GGTGATCAAG GCCGGCGGAC AGAAGGTTTA CCCGACGGCC GTCGAGTCGG CGCTCCAGGA GCATCCCGAC GTGCGGCATG CCGTTGTCTT CGGCGTCCAC GACCGGGACC GGATCGAGCA CGTGCACGCC GCAGTCGTCC TGGCACCCGG TTCCTCGGTC ACAGACGAGG AGCTGAGTCG CCATGTGGCC GCCACGCTCG ACTCGGCCCA TGCACCTGCG CACTTCAGCC GATGGGCCGA GATCCCGCTC ACCGCGTACG GGAAACCGGA CCGAGCGTCA CTGCGTTCCC GAGCGGAGCG GGAAGCGCTG GGCGGCGGGA CAGTGCAGAG AGGAAGGGCA CTGTGA
|
Protein sequence | MTSYTGAFSL EIPDLWKNHE NCAFRILYGL VQRPNSLVVR WRDREITAEL LAELILNAAE TFLQCGAGPR EAIAVLASSN HPAMLVCRYA AHLIGASVVY VRAANPRTDA EMLSRQVQSR ILEEVGARVL VVDASHIDRG RALVASATTL LTLVTETYPA VPLDTGRTPR LPDLPPYDGD ARALVTFTSG STGQPKSLSQ SYRTWNATVR GFSGRTDTHL PSRILAVTPV SHTVGFMVDS VLAAGGSAVL HEGFDAGTVL SDVARHRITD TYLAVPHLYR LVEHEDLPRT DVSSLRRLIY SGTPAAPRRI AQAVPCFRDA IVQLYGTTEA GGISSLTPLD HQEPELLPTV GRPFPWVQVR MCDPDTGAEV ERGHVGEVWT YSTTVMDGYL ETGVPTHSTL RDGWLRTGDL GYWDQYGYLR LVGRVGQVIK AGGQKVYPTA VESALQEHPD VRHAVVFGVH DRDRIEHVHA AVVLAPGSSV TDEELSRHVA ATLDSAHAPA HFSRWAEIPL TAYGKPDRAS LRSRAEREAL GGGTVQRGRA L
|
| |