Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3723 |
Symbol | |
ID | 3903824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4457247 |
End bp | 4458416 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637881049 |
Product | homoserine kinase |
Protein accession | YP_482804 |
Protein GI | 86742404 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.422816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.66686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACTC GGTCCTCCCC GCGCCCAGGC GGGACGGCGG CCTTCGCCGA GCGGGTCGGC CGGTCCGGGC CGGTCACCGC CGCCGACGGC GTGTCGCGGC GGGTCCGGGT CCGGGTGCCG GCGACCAGCG CGAATCTCGG CCCCGGCTTT GACACGTTCG GCCTGGCATT GGGCCTGTAC GACGAGGTCG ATGTCGAGAC GACCCGATCC GGTCTCACCG TTGACGTCGT CGGCACGGAC GTCGTCGCCC GGGACGAGAC GCACCTCGTG GTCCGGGCGA TCCGGGCGAC CTTCGACACC CTCGGCCGGG CGCAGCCCGG TCTCGCGTTG CACTGCGTCA ACCGGATCCC GCACGGCCGC GGGCTGGGAT CATCGGCCGC GGCGATCGTC GCCGGGGTCG TCGCCGCCGC CGCGCTCGCG CAGCCGGACC TCGGGCCCGC CTTCGAGACC GATGCCGCCT TTGACGCCGA TGCCGCCTTT GACGCCGATA CCGCCTTTGA CGCCGGCGCG GACTCCGTGT CCGCGTCCGA TGACGCACCC GAGCCGGACC CGGTGTCCAG AGCGGCGTCC GGAGCCGGCT TCTTCGGGCC CGCGGGCATG CTGCGGCTCG CCAGCGCCAT CGAGGGGCAC CCGGACAACG TCGCCGCGGC GCTGAGCGGG GGCTTCACCG TTGCGTGGCA GGACGGCGAC GGTGCCCGGT CCCTGCGCGT CGATCCCTTC GCCGAGCTGA GACCCGTGAT CTTCATCCCC GCGATACGGC AGTCGACCGA GCAGTCGCGC GGGGCGCTGC CGAGCGGCGT TCCGTTCGCC GACGCCGCCC GCAACCTCGG TCGGGCCGCG CTGCTGGCGT TGACGATGTC GGCCGCCGAG CCGGCCGCCG GGCAGCCGTC CCGACGTGCC GAGGCACTCC TGCGTGCCAC CGAGGATCTG GTGCATCAGC CCTACCGGTT CCCCGGCGTC CCGGCGTCGG CGGACCTCGT CGGTCGGCTG CGCGGCGGGG GGATCGCGGC GGCCCTGTCC GGTTCGGGGC CATCCGTCAT CGCCCTCGCC GTCGGTGGTG AACAGGCCGC GGCGGCGGTG GACGTGGCGG GTGCCGGGTT CTCCGTCGCC CCGCTGCCGG TGGACAGACA CGGTGCACGC GTCACCCGAT GTCGATCCGC AGCGCCATGA
|
Protein sequence | MTTRSSPRPG GTAAFAERVG RSGPVTAADG VSRRVRVRVP ATSANLGPGF DTFGLALGLY DEVDVETTRS GLTVDVVGTD VVARDETHLV VRAIRATFDT LGRAQPGLAL HCVNRIPHGR GLGSSAAAIV AGVVAAAALA QPDLGPAFET DAAFDADAAF DADTAFDAGA DSVSASDDAP EPDPVSRAAS GAGFFGPAGM LRLASAIEGH PDNVAAALSG GFTVAWQDGD GARSLRVDPF AELRPVIFIP AIRQSTEQSR GALPSGVPFA DAARNLGRAA LLALTMSAAE PAAGQPSRRA EALLRATEDL VHQPYRFPGV PASADLVGRL RGGGIAAALS GSGPSVIALA VGGEQAAAAV DVAGAGFSVA PLPVDRHGAR VTRCRSAAP
|
| |