Gene Franean1_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0202 
Symbol 
ID5668627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp249519 
End bp250583 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID641239131 
Productphosphoribosylaminoimidazole-succinocarboxamide synthase 
Protein accessionYP_001504575 
Protein GI158312067 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0152] Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase 
TIGRFAM ID[TIGR00081] phosphoribosylaminoimidazole-succinocarboxamide synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.160911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.75146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCGG CGGCCGCCGG CATCCGCATC CCGACCAGGA CCCCGGGCCG GTCGCCCCGG 
CCCGGGTCCG CCGCCGGGCA ACCGAGCGGG GGCATCACGA AGAGGCACGT TGAGCGGGCG
TCCCGGGCAG GAGGCTACCG GTACGGTCGG ACCATGCCGC TGACACATGA GGAGTTCGCC
GGGCTGACGC ATCTCGGCTC GGGGAAGGTG CGTGAGCTGT TCGCGATCGG GGATGACGCG
GTGCTGCTCG TGGCGAGCGA CCGGATCTCG GCCTTCGACG TCGTGCTGCC CACGGAGATC
CCAGACAAGG GCGCGGTGCT CACCGGGCTC AGCCTGTGGT GGTTCGACCA GCTTGGTGAT
CTCGTCCCGA GCCATGTGAT CAGTTCGAGT GTGGACGAGT ATCCGGCGGA ACTCGCGCCC
TACGCCGAGC AGCTGCGCGG GCGCTCGATG CTGTGCCGCC GGCTCGACAT GGTCCAGATC
GAGTGCGTCG CCCGCGGTTA CCTGACCGGC AGCGGTCTGA AGGACTACCG GCGCTCCGGC
ACCGTCAGCG GCCATCCGCT CCCCGCCGGC CTGGAGGATG GCAGCAGGCT GCCGAACCCG
ATCTACACGC CGTCGACGAA GGCACCGATC GGGGAGCATG ACGAGAACAT CAGCCGGGAC
GACGCGGCCG GCCGGGTCGG CGCGGAGCTG GCGGCCGAGC TCGAGCGGCT CACCCTGCAG
ATCTTCGGGC GGGCCAGCGA CCTGGCCGCC GAGCGCGGGA TCCTGCTCGC CGACACCAAG
TTCGAGTTCG GCCACGACGC GGACGGCGTG CTGCGGCTCG CCGACGAGGT ACTCACCCCG
GACTCGTCCC GGTTCTGGCC GGCGGACGCC TGGACGCCGG GCGGCACGCA GCCGTCCTAT
GACAAGCAGT TCATCCGCGA CTACCTGGTC AGCACGGGGT GGGACCGCAA CCCGCCGGCA
CCGGAGCTGC CCGACGACAT CGTCGAGTCG ACGCGCGCCC GCTATGTCGA GGCCTACGAG
CGGCTGACCG GGATCTCGTT CAAGGATTAC CTGTCCACCG CGTGA
 
Protein sequence
MLPAAAGIRI PTRTPGRSPR PGSAAGQPSG GITKRHVERA SRAGGYRYGR TMPLTHEEFA 
GLTHLGSGKV RELFAIGDDA VLLVASDRIS AFDVVLPTEI PDKGAVLTGL SLWWFDQLGD
LVPSHVISSS VDEYPAELAP YAEQLRGRSM LCRRLDMVQI ECVARGYLTG SGLKDYRRSG
TVSGHPLPAG LEDGSRLPNP IYTPSTKAPI GEHDENISRD DAAGRVGAEL AAELERLTLQ
IFGRASDLAA ERGILLADTK FEFGHDADGV LRLADEVLTP DSSRFWPADA WTPGGTQPSY
DKQFIRDYLV STGWDRNPPA PELPDDIVES TRARYVEAYE RLTGISFKDY LSTA