Gene Francci3_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3723 
Symbol 
ID3903824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4457247 
End bp4458416 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content75% 
IMG OID637881049 
Producthomoserine kinase 
Protein accessionYP_482804 
Protein GI86742404 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0083] Homoserine kinase 
TIGRFAM ID[TIGR00191] homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.422816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.66686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACTC GGTCCTCCCC GCGCCCAGGC GGGACGGCGG CCTTCGCCGA GCGGGTCGGC 
CGGTCCGGGC CGGTCACCGC CGCCGACGGC GTGTCGCGGC GGGTCCGGGT CCGGGTGCCG
GCGACCAGCG CGAATCTCGG CCCCGGCTTT GACACGTTCG GCCTGGCATT GGGCCTGTAC
GACGAGGTCG ATGTCGAGAC GACCCGATCC GGTCTCACCG TTGACGTCGT CGGCACGGAC
GTCGTCGCCC GGGACGAGAC GCACCTCGTG GTCCGGGCGA TCCGGGCGAC CTTCGACACC
CTCGGCCGGG CGCAGCCCGG TCTCGCGTTG CACTGCGTCA ACCGGATCCC GCACGGCCGC
GGGCTGGGAT CATCGGCCGC GGCGATCGTC GCCGGGGTCG TCGCCGCCGC CGCGCTCGCG
CAGCCGGACC TCGGGCCCGC CTTCGAGACC GATGCCGCCT TTGACGCCGA TGCCGCCTTT
GACGCCGATA CCGCCTTTGA CGCCGGCGCG GACTCCGTGT CCGCGTCCGA TGACGCACCC
GAGCCGGACC CGGTGTCCAG AGCGGCGTCC GGAGCCGGCT TCTTCGGGCC CGCGGGCATG
CTGCGGCTCG CCAGCGCCAT CGAGGGGCAC CCGGACAACG TCGCCGCGGC GCTGAGCGGG
GGCTTCACCG TTGCGTGGCA GGACGGCGAC GGTGCCCGGT CCCTGCGCGT CGATCCCTTC
GCCGAGCTGA GACCCGTGAT CTTCATCCCC GCGATACGGC AGTCGACCGA GCAGTCGCGC
GGGGCGCTGC CGAGCGGCGT TCCGTTCGCC GACGCCGCCC GCAACCTCGG TCGGGCCGCG
CTGCTGGCGT TGACGATGTC GGCCGCCGAG CCGGCCGCCG GGCAGCCGTC CCGACGTGCC
GAGGCACTCC TGCGTGCCAC CGAGGATCTG GTGCATCAGC CCTACCGGTT CCCCGGCGTC
CCGGCGTCGG CGGACCTCGT CGGTCGGCTG CGCGGCGGGG GGATCGCGGC GGCCCTGTCC
GGTTCGGGGC CATCCGTCAT CGCCCTCGCC GTCGGTGGTG AACAGGCCGC GGCGGCGGTG
GACGTGGCGG GTGCCGGGTT CTCCGTCGCC CCGCTGCCGG TGGACAGACA CGGTGCACGC
GTCACCCGAT GTCGATCCGC AGCGCCATGA
 
Protein sequence
MTTRSSPRPG GTAAFAERVG RSGPVTAADG VSRRVRVRVP ATSANLGPGF DTFGLALGLY 
DEVDVETTRS GLTVDVVGTD VVARDETHLV VRAIRATFDT LGRAQPGLAL HCVNRIPHGR
GLGSSAAAIV AGVVAAAALA QPDLGPAFET DAAFDADAAF DADTAFDAGA DSVSASDDAP
EPDPVSRAAS GAGFFGPAGM LRLASAIEGH PDNVAAALSG GFTVAWQDGD GARSLRVDPF
AELRPVIFIP AIRQSTEQSR GALPSGVPFA DAARNLGRAA LLALTMSAAE PAAGQPSRRA
EALLRATEDL VHQPYRFPGV PASADLVGRL RGGGIAAALS GSGPSVIALA VGGEQAAAAV
DVAGAGFSVA PLPVDRHGAR VTRCRSAAP