Gene Franean1_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4071 
Symbol 
ID5672429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4853713 
End bp4854954 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content77% 
IMG OID641242947 
Producthomocitrate synthase 
Protein accessionYP_001508364 
Protein GI158315856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00164533 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCG GCACGTCGTC CTGCCCGCCC GCGCCGCGCC CCGTCCCGCC GGATCACGCG 
CGGGAGAACG GGTCCGGGCC GGAACCCGGG CCTGGCCCCG GTCCGGTCCG GTTCTGCGAC
ACCACCCTGC GTGACGGCGA GCAGACCCCG GGGGTCGCCT TCACCGCCAA GGAGAAGATC
GCCATCGCGG TCGCGCTCGA CGCCGCCGGC GTGCACCAGA TCGAGGCCGG GGTGCCGGCG
ATGGGGCCGG TCGAGGTGGA TGTCCTGCGC CGCGTCGTCG CGGCGGTCGA GCGGGCCGGC
GTGGTCGCCT GGTGCCGCGC GGACCGCCGC GACGTCGACG CCGCCGTCGC CAGCGGGGTC
GACGCCGCGC ACCTGACCAT CCCCGCCTCC GACCTTCACC TGCGCACCAA GCTCGGCAAG
GACCGGGCCT GGGCGCGGGC CCGCATCCGC GACTGCGTCC TCGACGCCAC CGACCGCGGC
CTGCGGGTGA GCGTCGGCTT CGAGGACGCC TCCCGCGCGG ACGACGGCTT CGTCATCGAC
CTGGCCGGGG AGCTACGGCG CCTCGGCGTC ACCCGGTTCC GCTGGGCGGA CACGGTCGGG
GTGGCCAATC CGATCACCCT GCACACGCGG CTGCGAGCGC TGCTCGACGC CGTCCCCGGG
CCGTGGGAGA TCCACGCCCA CGACGACTTC GGGCTGGCGA CCGCCAACAC GATCGCCGCG
GTGCAGGCCG GGTTCACCTG GGTGAGCACC ACGGTGGCCG GCCTGGGCGA GCGTGCCGGC
AACGCGCCGA CCGAGGAGGT CGCGATGGCG CTGCGGCACC TGCTCGGCCT GCCCGTCGAC
CTGGACACCG CCGCGTTCCG CCCGCTGGCG CGGCTGGTCG CCGGCGCGTC GCGCCGGCCG
GTTCCCGCCG GCAAGGCCGT GGTCGGCGAC GCCGTGTTCG ACCACGAGTC CGGCATCCAC
GTGCACGGCG TGCTGCGCGC CCCGGCGACC TACGAGCCGT TCGACCCGGC GGAGGTCGGC
GCGCGCCGGC GGCTGGTGCT GGGCAAGCAC AGCGGCCGCG CCGCCGTGCG GCACGCGATG
GACCGGCACG GCATCGACGC GCCCGACGAG GACCTGGAAC CGATCGTCGG CCTGGTCCGC
GCGCACGCCA CCGTGTACAA GCAGCCGCTG AGTTCCGACC AGCTGCGGGC GATGGCCCGG
CGGGTCGCCA CCCGCCGCGG CGCACGGCCC CGCCGCGGCT GA
 
Protein sequence
MKSGTSSCPP APRPVPPDHA RENGSGPEPG PGPGPVRFCD TTLRDGEQTP GVAFTAKEKI 
AIAVALDAAG VHQIEAGVPA MGPVEVDVLR RVVAAVERAG VVAWCRADRR DVDAAVASGV
DAAHLTIPAS DLHLRTKLGK DRAWARARIR DCVLDATDRG LRVSVGFEDA SRADDGFVID
LAGELRRLGV TRFRWADTVG VANPITLHTR LRALLDAVPG PWEIHAHDDF GLATANTIAA
VQAGFTWVST TVAGLGERAG NAPTEEVAMA LRHLLGLPVD LDTAAFRPLA RLVAGASRRP
VPAGKAVVGD AVFDHESGIH VHGVLRAPAT YEPFDPAEVG ARRRLVLGKH SGRAAVRHAM
DRHGIDAPDE DLEPIVGLVR AHATVYKQPL SSDQLRAMAR RVATRRGARP RRG