Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4071 |
Symbol | |
ID | 5672429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4853713 |
End bp | 4854954 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242947 |
Product | homocitrate synthase |
Protein accession | YP_001508364 |
Protein GI | 158315856 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00164533 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGCG GCACGTCGTC CTGCCCGCCC GCGCCGCGCC CCGTCCCGCC GGATCACGCG CGGGAGAACG GGTCCGGGCC GGAACCCGGG CCTGGCCCCG GTCCGGTCCG GTTCTGCGAC ACCACCCTGC GTGACGGCGA GCAGACCCCG GGGGTCGCCT TCACCGCCAA GGAGAAGATC GCCATCGCGG TCGCGCTCGA CGCCGCCGGC GTGCACCAGA TCGAGGCCGG GGTGCCGGCG ATGGGGCCGG TCGAGGTGGA TGTCCTGCGC CGCGTCGTCG CGGCGGTCGA GCGGGCCGGC GTGGTCGCCT GGTGCCGCGC GGACCGCCGC GACGTCGACG CCGCCGTCGC CAGCGGGGTC GACGCCGCGC ACCTGACCAT CCCCGCCTCC GACCTTCACC TGCGCACCAA GCTCGGCAAG GACCGGGCCT GGGCGCGGGC CCGCATCCGC GACTGCGTCC TCGACGCCAC CGACCGCGGC CTGCGGGTGA GCGTCGGCTT CGAGGACGCC TCCCGCGCGG ACGACGGCTT CGTCATCGAC CTGGCCGGGG AGCTACGGCG CCTCGGCGTC ACCCGGTTCC GCTGGGCGGA CACGGTCGGG GTGGCCAATC CGATCACCCT GCACACGCGG CTGCGAGCGC TGCTCGACGC CGTCCCCGGG CCGTGGGAGA TCCACGCCCA CGACGACTTC GGGCTGGCGA CCGCCAACAC GATCGCCGCG GTGCAGGCCG GGTTCACCTG GGTGAGCACC ACGGTGGCCG GCCTGGGCGA GCGTGCCGGC AACGCGCCGA CCGAGGAGGT CGCGATGGCG CTGCGGCACC TGCTCGGCCT GCCCGTCGAC CTGGACACCG CCGCGTTCCG CCCGCTGGCG CGGCTGGTCG CCGGCGCGTC GCGCCGGCCG GTTCCCGCCG GCAAGGCCGT GGTCGGCGAC GCCGTGTTCG ACCACGAGTC CGGCATCCAC GTGCACGGCG TGCTGCGCGC CCCGGCGACC TACGAGCCGT TCGACCCGGC GGAGGTCGGC GCGCGCCGGC GGCTGGTGCT GGGCAAGCAC AGCGGCCGCG CCGCCGTGCG GCACGCGATG GACCGGCACG GCATCGACGC GCCCGACGAG GACCTGGAAC CGATCGTCGG CCTGGTCCGC GCGCACGCCA CCGTGTACAA GCAGCCGCTG AGTTCCGACC AGCTGCGGGC GATGGCCCGG CGGGTCGCCA CCCGCCGCGG CGCACGGCCC CGCCGCGGCT GA
|
Protein sequence | MKSGTSSCPP APRPVPPDHA RENGSGPEPG PGPGPVRFCD TTLRDGEQTP GVAFTAKEKI AIAVALDAAG VHQIEAGVPA MGPVEVDVLR RVVAAVERAG VVAWCRADRR DVDAAVASGV DAAHLTIPAS DLHLRTKLGK DRAWARARIR DCVLDATDRG LRVSVGFEDA SRADDGFVID LAGELRRLGV TRFRWADTVG VANPITLHTR LRALLDAVPG PWEIHAHDDF GLATANTIAA VQAGFTWVST TVAGLGERAG NAPTEEVAMA LRHLLGLPVD LDTAAFRPLA RLVAGASRRP VPAGKAVVGD AVFDHESGIH VHGVLRAPAT YEPFDPAEVG ARRRLVLGKH SGRAAVRHAM DRHGIDAPDE DLEPIVGLVR AHATVYKQPL SSDQLRAMAR RVATRRGARP RRG
|
| |