Gene Franean1_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1013 
Symbol 
ID5669427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1195305 
End bp1196561 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID641239942 
Productglycine hydroxymethyltransferase 
Protein accessionYP_001505375 
Protein GI158312867 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.239526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTACCC CGTTCTGGGG CCCGGACTTC GACCAGCTGC GGGCCACCGA TCCGGACATC 
GCCGCGGTCG TGGTCGACGA GTTGGAGCGG CTGCGCGGCG GCCTCCAGCT CATCGCGAGC
GAGAACTTCA CCTCGCCGGC GGTGCTCGCG GCGCTCGGCT CAACGCTGTC GAACAAGTAC
GCCGAGGGCT ACCCGGGGCG CCGCTACTAC GGCGGGTGCC AGGTGGTCGA CCGCGCCGAG
GAGATCGGCA TCGCGCGGGC CCGGGAGCTG TTCGGTGCCG AGCACGCGAA CCTGCAGCCG
CACTCGGGCA CGCAGGCGAA CTTCGCCGTC TACGCCGCCC TGCTCACACC GGGGGACACT
GTCCTGGCGA TGTCGCTGCC GCACGGCGGG CACCTGACGC ACGGCAGCCG TGTCAACTTC
TCCGGGCGGT GGTTCGACGT CGTGGCCTAC GGGGTCCGGG AGGACACCGA GCTCATCGAC
TACGACCAGG TCCGTGAGCT GGCCCTGCAA CACCGGCCCA AGATGATCAT TTGCGGGGCG
ACGGCCTACC CCCGCCGCAT CGACTTCGCC GCGTTCCGCT CGATCGCCGA CGAGGTCGGC
GCCTGGCTGA TGGTGGACGC GGCGCACTTC ATCGGGCTGG TCGCCGGCGG CGCGCTGCCG
AGCCCGGTGC CGCACGCCGA CGTCGTCAGC TTCACCACGC ACAAGGTGCT GCGCGGCCCG
CGCGGCGGCA TGATCCTCTG CCGGGAGGAG CTGGCCGCCC GCATCGACAA GGCGGTGTTC
CCGTTCAGCC AGGGCGGGCC GCTGATGCAC GCCGTCGCGG CGAAGGCGGT GGCGCTCAAG
GAGGCCGCGA CCCCCGAGTA CGCCACCTAC GCCCACCAGG TGATCGCGAA CGCGCAGACC
CTCGCCGAGG GGCTGGCGGC CGAGGGTGTC CGGCCGGTGG CGGGCGGCAC CGACACCCAC
CTGACCCTGC TCGACCTGCG GGAGCTCGGC GTGACCGGCC GCGACGCCGA GGCACGCTGC
GACGCGGCGG GCATCACGCT CAACAAGAAC GCCATCCCGT ACGACCCGCA GCCGCCCGCG
ATCTCCTCCG GCATCCGGGT GGGCACTCCC GCGGTGACGA CGCAGGGCAT GCGGGAGGGC
GAAATGAAGG AGATCGCCGG TCTGATCGCC CGTGCGGTGC GTGACCCGTC CGCGGCGGCT
GACGTCTCCG CCGCGGTGTC CGTGCTCGTC GACCGTCACC CGGCCTATCC GAGATAG
 
Protein sequence
MSTPFWGPDF DQLRATDPDI AAVVVDELER LRGGLQLIAS ENFTSPAVLA ALGSTLSNKY 
AEGYPGRRYY GGCQVVDRAE EIGIARAREL FGAEHANLQP HSGTQANFAV YAALLTPGDT
VLAMSLPHGG HLTHGSRVNF SGRWFDVVAY GVREDTELID YDQVRELALQ HRPKMIICGA
TAYPRRIDFA AFRSIADEVG AWLMVDAAHF IGLVAGGALP SPVPHADVVS FTTHKVLRGP
RGGMILCREE LAARIDKAVF PFSQGGPLMH AVAAKAVALK EAATPEYATY AHQVIANAQT
LAEGLAAEGV RPVAGGTDTH LTLLDLRELG VTGRDAEARC DAAGITLNKN AIPYDPQPPA
ISSGIRVGTP AVTTQGMREG EMKEIAGLIA RAVRDPSAAA DVSAAVSVLV DRHPAYPR