Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3628 |
Symbol | |
ID | 3904184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4332569 |
End bp | 4334218 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880951 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_482709 |
Protein GI | 161353753 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.883953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.554164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCGGGG CGGAACGACC CAAGATCCTT GCGGAGGGGC GCCCCGTGCG GCCATACGAT CCCGATTCCC TGCACGTCTA TGACACGACA TTGCGGGACG GTACCCAGCA GGAGGGGCTG TCGCTCTCGG TCGGCGACAA ACTCGCGGTG GCCCGTCACC TGGACGATCT CGGCGTCGGT TTCATCGAGG GCGGCTGGCC GGGATCGAAC CCGAAGGACG CGGAGTTCTT CACCCGGGCG CGCGGCGAAC TGAACCTGAC GACCGCCGTG CTGACCGCCT TCGGGGCCAC CCGGCGGGCG GGCCGCGCCG CCGCGGACGA CCCCCAGGTG GCGGCGCTTC GGGACGCGGG GACCTCGGTG GTCTGCCTGG TCGCCAAGTC GGATCTGCGC CATGTCGAAC GTGCCCTGCG CACCACCCCG GCCGAGAACC TCGCGATGAT CCGGGACACC GTCGCGCATC TGCGCGCGGA GGGAAAGCGG GTGTTCGTTG ACGCCGAGCA CTTCTTCGAC GGCCATCGGG TGTATCCCGA CTACGCTCTT GAGGTCGTCT CCGTTGCGGC CGAGGCCGGT GCCGAGGTCG TCGTGCTGTG CGACACGAAC GGTGGCATGC TCCCGACGCG GATCGGCGCG GTCGTCGCCG ATGTCCTCGC CGCAACCGGC GCCCGGCTCG GTATCCACTG CCACGACGAC GCCGACTGCG CGGTCGCGAA CACCCTGGTG GCGGTGGAGG CCGGCGTCAC CCACGTGCAG GGCACCGCAA ACGGCTACGG CGAGCGGTGC GGGAACGCCA ACCTGCTGAG CGTCATCGCG GGCCTGGAGA CCAAGTTGGA CCGGGCCGTG CTGCCGCCCG GGCGGCTGCG TGAGCTCGTC CGGGTCTCTC ACGCGATCGA CGAGGTCACC AACTCCGTGC CCGACCCGCA CCGGCCGTAC GTCGGGGCCA GCGCCTTCGC GCACAAGGCC GGCCTGCACG CCAGTGCCGT CAAGGTCGAT CCGGACCTGT ACCAGCACAT CGACCCGGCG GTCGTGGGCA GTGACATGAG GATGCTGGTC TCGGAACTCG CCGGTCGGGC GACCATCGAA CTGAAGGGCC GTGAGCTCGG CCTCGACCTG TCGAACCAGC GTGAGGCGCT CGGACGGGTG GTGGACCTGG TCAAGGAGCG GGAGGCAGCC GGATACGTCT ACGAGGCCGC GGAGGCCTCC TTCGAACTCC TGCTGCGCGA CGAGGTGACC GGCCGGCAGC GGTTCTACAC CCTCGAATCC TGGCGCGTCA TCGTCGAGCA GCGTCCGGGC GGCGAGGTGG CCAGCGAGGC GACGGTGAAG CTCACCTCGC ACGGCGAACG CCACGTCGCG ACGGCGGAGG GGAACGGACC GGTCAACGCT CTCGACACGG CACTGCGCAA CGCGCTGGAG AAGGCGTATC CGGGGTTGGC CGATCTGGAC CTGGTGGACT ACAAGGTTCG GATCCTCGAC GGCAAGCAGG GGACCGGCGC GGTGACCCGG GTCCTGCTGG GAACCAGCGA CGGCCGCGAG CGCTGGGACA CGATCGGCGT GGACGAGAAC ATCATCGCAG CCTCCTGGGC GGCCCTGGAG GACGCTGTCG ACTACGGACT GCGGCGGCAG GGGGAGAGCC CGGACCCGGT CGCGCCCTGA
|
Protein sequence | MAGAERPKIL AEGRPVRPYD PDSLHVYDTT LRDGTQQEGL SLSVGDKLAV ARHLDDLGVG FIEGGWPGSN PKDAEFFTRA RGELNLTTAV LTAFGATRRA GRAAADDPQV AALRDAGTSV VCLVAKSDLR HVERALRTTP AENLAMIRDT VAHLRAEGKR VFVDAEHFFD GHRVYPDYAL EVVSVAAEAG AEVVVLCDTN GGMLPTRIGA VVADVLAATG ARLGIHCHDD ADCAVANTLV AVEAGVTHVQ GTANGYGERC GNANLLSVIA GLETKLDRAV LPPGRLRELV RVSHAIDEVT NSVPDPHRPY VGASAFAHKA GLHASAVKVD PDLYQHIDPA VVGSDMRMLV SELAGRATIE LKGRELGLDL SNQREALGRV VDLVKEREAA GYVYEAAEAS FELLLRDEVT GRQRFYTLES WRVIVEQRPG GEVASEATVK LTSHGERHVA TAEGNGPVNA LDTALRNALE KAYPGLADLD LVDYKVRILD GKQGTGAVTR VLLGTSDGRE RWDTIGVDEN IIAASWAALE DAVDYGLRRQ GESPDPVAP
|
| |