Gene Francci3_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3717 
Symbol 
ID3903818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4449309 
End bp4450565 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID637881043 
Productserine hydroxymethyltransferase 
Protein accessionYP_482798 
Protein GI86742398 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.982412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.848378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC CGTTCTGGGG CCCGGACTTC GACCAGCTGA GCGCGACGGA TCCGCAGATC 
GCGGAGGTGG TCCTCGATGA GCTGGACCGG CTGCGCGGCG GCCTGCAACT CATCGCGAGC
GAGAACTTCA CGTCCCCGGC GGTGCTGGCG GCGCTGGGCT CGACGTTGTC GAACAAGTAT
GCCGAAGGGT ATCCGGGTCG GCGTTACTAC GGCGGTTGCC AGGTGGTCGA CCGGGCCGAG
GAGATCGGCA TCGCCCGGGC GAAGCAGCTC TTCGGCGCGG AGCACGCCAA CCTGCAACCG
CATTCGGGCT CGTCGGCGAA CTTCGCCGTG TACGCGGCGC TACTCACGCC AGGTGACACG
GTCCTGGCGA TGTCGTTGCC GCATGGCGGT CACCTCACCC ACGGCAGCAA GGTGAGCTTC
TCCGGTAAGT GGTTCAACGT GGTGGCCTAC GGCGTGCGGG AGGACACCGA GCTGATCGAC
TACGACCAGG TGCGGGAGCT CGCCCGCCAG CACCGGCCCA AGATGATCAT CTGTGGGGCG
ACGGCCTACC CACGTCTGAT CGACTTCGCC GCGTTCCGCT CGATCGCCGA CGAGGTCGGT
TCGTGGCTGA TGGTGGACGC GGCGCACTTC ATCGGTCTGG TCGCCGGGGG CGCGATCCCG
AGCCCCGTTC CCTACGCCGA CGTTGTCAGC TTCACCACCC ACAAGGTGTT GCGCGGCCCG
CGAGGGGGCA TGATCCTCGC GCGTGAGGAG CTGGCTTCCC GCATCGACAA GGCCGTGTTC
CCGTTCAGCC AGGGTGGCCC GCTGATGCAC GCGGTCGCGG CGAAGGCCGT CGCGTTGCGG
GAGGCGGCCT CGCCCGCTTA CGCGCAGTAC GCTCGCCAGG TGGTGGCCAA CGCGCAGCGG
CTCGCCGACG AGCTTGCCGC CGAGGGCATC CGGCCCGTCG CCGGTGGCAC CGACACCCAT
CTCGCCCTGC TCGACCTGCG GGAACTCGGG GTCAGCGGCA AGGAAGCCGA GGCGCGTTGC
GACGCGGCCG GCATCACCCT GAACAAGAAC GCCATTCCCT ATGACCCGCA GCCGCCCGCG
ATCTCCTCCG GCATCCGGGT GGGAACCCCG GCGGTCACCA CCCAGGGGAT GGGCGAGGGG
GAGATGAAGG AGATCGCGGG GCTGATCGCC CACGCGGTGC GTGAGCCGGA AGCCGCCGCC
GACGTCGCTG CGGCGGTGTC CGCGCTCGTC GCCCGGCATC CGGCCTATCC GCGGTAG
 
Protein sequence
MSTPFWGPDF DQLSATDPQI AEVVLDELDR LRGGLQLIAS ENFTSPAVLA ALGSTLSNKY 
AEGYPGRRYY GGCQVVDRAE EIGIARAKQL FGAEHANLQP HSGSSANFAV YAALLTPGDT
VLAMSLPHGG HLTHGSKVSF SGKWFNVVAY GVREDTELID YDQVRELARQ HRPKMIICGA
TAYPRLIDFA AFRSIADEVG SWLMVDAAHF IGLVAGGAIP SPVPYADVVS FTTHKVLRGP
RGGMILAREE LASRIDKAVF PFSQGGPLMH AVAAKAVALR EAASPAYAQY ARQVVANAQR
LADELAAEGI RPVAGGTDTH LALLDLRELG VSGKEAEARC DAAGITLNKN AIPYDPQPPA
ISSGIRVGTP AVTTQGMGEG EMKEIAGLIA HAVREPEAAA DVAAAVSALV ARHPAYPR