Gene Franean1_6543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6543 
Symbol 
ID5674858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7958399 
End bp7959388 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID641245392 
Productglycosyl transferase family protein 
Protein accessionYP_001510786 
Protein GI158318278 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.118819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCG ACGTCTCAGT CCTGATCGTC TCATACAACA CCGGTGAAAT GACGGCGACG 
TGCCTGGAGT CCCTCGAAGC CACGTCCGGC GGCCTGCGCA TCGAGGTCAT AGTCGTGGAC
AATGCCTCGA CGGATGGATC CGCCGAAATC GTCCGTAAAC GTTTTCCCTC GGTCAAGCTG
ATCGAGCTGA GTTCGAACAT CGGTTTCGGC CGAGCGGTCA ATCTCGCGGC ATCTAATGCC
CTGGGAAACT ATCTGCTCCT GCTCAATCCC GACGCCGTCG TGCTCACTGG CGCGGTTCAG
AATCTGTTGG AGTTCGCCCG GGCCAACCCA CGGCACGGCA TCTACGGCGG CCGCACCTTC
GACCCGCAGG GCGCCGCCAG CCACACCTCG TGCTTTGGTG CGCCCACCGT CTGGAGTCAC
GTCTGCTTCG GCATGGGCCT GTCTACCGTC TTCCGGCGCT CACGTGTCTT CGATCCGGAA
TCGCTGGGAC GGTGGGAGCG TGACAGCGTC CGGACTGTGG GCGTGGTGAC CGGCTGCCTG
CTGCTCGTTC GGCGGGCACT GTTCGAGCAG TTGGGAGGCT TCGACCCCCG CTTCTTCATG
TACGGCGAAG ATGTCGACCT GTCGGTGCGC GCCCGCCGAG CCGGCTGGGA TCCGGTGATC
ACGCCCGACG CGGTCGTGAT TCACCACGGT GGCGCGTCGT CGTCCAACTG GACGGGCAAG
CATGTCCTGG TGATGAAGGG GAAGACGACG CTCGCACGGG TGCACTGGAC CGGATGGCGT
AGCGGGCTGT GCCTGACGAT GCTGTGGCTC GGGGTGACGC TACGGGCGAT GCCCACGGTG
GCATCCGGTG GCCGGTCGGC GGGTAGTGGG ACCAGCGACT GGCGTGGTCT GTGGCACCGC
AGAGCCGACT GGTGGTCCGG GTACGAGCAG GTCGCACCCG AGCCGGCCGA GACCGGACGC
GAAGAAAGCG CGGCCCGGCC ACCGGTGTGA
 
Protein sequence
MAVDVSVLIV SYNTGEMTAT CLESLEATSG GLRIEVIVVD NASTDGSAEI VRKRFPSVKL 
IELSSNIGFG RAVNLAASNA LGNYLLLLNP DAVVLTGAVQ NLLEFARANP RHGIYGGRTF
DPQGAASHTS CFGAPTVWSH VCFGMGLSTV FRRSRVFDPE SLGRWERDSV RTVGVVTGCL
LLVRRALFEQ LGGFDPRFFM YGEDVDLSVR ARRAGWDPVI TPDAVVIHHG GASSSNWTGK
HVLVMKGKTT LARVHWTGWR SGLCLTMLWL GVTLRAMPTV ASGGRSAGSG TSDWRGLWHR
RADWWSGYEQ VAPEPAETGR EESAARPPV