Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3949 |
Symbol | |
ID | 5672310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4721200 |
End bp | 4722546 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242828 |
Product | carbamoyl-phosphate synthase L chain ATP-binding |
Protein accession | YP_001508245 |
Protein GI | 158315737 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.866278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0162238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCAAGA AGGTCCTGAT CGCGAACCGC GGGGAGATCG CGCTGCGGGT CGCGCGCACC TGCCGCGAGC TGGGCATCGC CACCGTGGCC GCCCACTCGT CGGCGGACCG GGACTCCGCC GTCGTCTCCT TCGCCGACGA GTCCGTCCAG ATCGGCCCGG GGCCGTCCCG GGACAGCTAC CTCAACATCC CGGCGGTCGT CGAGGCCGCC CGTATCCGCG GCGCCGACGC CGTCCACCCC GGGTACGGGT TCCTCTCCGA GAATCCCGAC TTCGCCGAGA TCTGCGCGGC CGAGGGCCTG ACGTTCATCG GCCCGCCGGC CGAGGTGATG GCCCGCCTCG GTGACAAGGC TGTCGCCCGT CGGATCATGT CCGAGGCCGG GCTGCCCCTG CTGCCCGGCA GCCGGGACAC TCCCGACCAC CCCGAACAGG CCGCGCAGGT GGCGGCCGAG ATCGGCTACC CGGTCATCAT CAAGGCGGCG GCCGGCGGCG GCGGGCGCGG GATGAGCGTC GTGCACGATC CGGACGGCTT CGAGCGCGCC TACCGGCACA CCCGCTCCAC CGCCCAGGCG GTGTTCGGGG ACGGCCGCCT GTACGTCGAG CGTTACCTGG CCTCGGCACG GCACGTCGAG GTCCAGGTGC TCGCCGATGC GCACGGCGCG GCCGTCCACC TGGGCGCCCG GGACTGCTCG CTGCAGCGCC GGCACCAGAA GCTGGTCGAG GAGACTCCGG CGCCGGCCCT GCCAGCCGAG ATCGTCGAGC CGCTGTGTGC GGCCGCCGTC CGCGGCACCC TGGCCAGCGG GTATGTCGGC GCGGGCACCT TCGAGTTCCT CGTCGACGGC GAGGGCCGGT TCTACTTCAT GGAGGTCAAC TGCCGGCTGC AGGTGGAGCA CCCGGTCACC GAGATGGTCA CCGGGCTCGA CCTGGTCGCC GAGCAGATCC GGATCGCCGC GGGCGAGCCG CTGGGCTACG GCCAGGACGA CGTCGACCCG CGCGGGGTGT CGATCGAGTG CCGGATCAAC GCCGAGGACC CGCGTCGTGA CTTCGCCCCC GCGCCGGGTT CGCTCACCGA GTTCACCGTC CCGGCCGGCC CGTTCGTCCG CGTCGACACG CACGCCGCCC CCGGGTACCG CATCCCGGCC TACTACGACT CACTGGTCGC GAAAGTGGTT GTCTGGGCCC CGGACCGCCC CCGGGCCCTG GCCCGGATGC GCCGCGCCCT CGACGAGCTG CACGCCGACG GGCCCGGCGT CGTCACCACC GCCGGCTTCC TGCGCGAACT GCTCGACCAC CCCCGGTTCA TCGCCGCCGA ACACGACACC GTCCTGATCG AGTCCATGAC CCACTAA
|
Protein sequence | MIKKVLIANR GEIALRVART CRELGIATVA AHSSADRDSA VVSFADESVQ IGPGPSRDSY LNIPAVVEAA RIRGADAVHP GYGFLSENPD FAEICAAEGL TFIGPPAEVM ARLGDKAVAR RIMSEAGLPL LPGSRDTPDH PEQAAQVAAE IGYPVIIKAA AGGGGRGMSV VHDPDGFERA YRHTRSTAQA VFGDGRLYVE RYLASARHVE VQVLADAHGA AVHLGARDCS LQRRHQKLVE ETPAPALPAE IVEPLCAAAV RGTLASGYVG AGTFEFLVDG EGRFYFMEVN CRLQVEHPVT EMVTGLDLVA EQIRIAAGEP LGYGQDDVDP RGVSIECRIN AEDPRRDFAP APGSLTEFTV PAGPFVRVDT HAAPGYRIPA YYDSLVAKVV VWAPDRPRAL ARMRRALDEL HADGPGVVTT AGFLRELLDH PRFIAAEHDT VLIESMTH
|
| |