Gene Franean1_3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3857 
Symbol 
ID5672220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4584655 
End bp4585926 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content77% 
IMG OID641242735 
Productglycosyl transferase group 1 
Protein accessionYP_001508155 
Protein GI158315647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.352881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGTG TGCTGGTGGT GGCGGACAAG TTCCCGCCGA CGATCGGCGG GATCCAGACG 
TTCGCCTGCC GGCTGACGGC CGGCCTACCC CCGGACCGGG CCGTCGTCCT CGCCCCGGCC
CAGCCCGGTG ACGCCGAGTT CGACCGCACC CTCGGCTTTC CGGTGATCCG CACCGAGCAC
GGCATGATCA CCTCGCCGCG TGGCCGGCGG GAGCTGCGGG CCGCCGTCCG GGCGCACGGG
TGCGAGGTGG CGTGGTTCCC CACCGCTGCC CCGCTCGGCG TGCTCGCGCC GGTGCTGCGC
GAGGCCGGCG TCGAGCGGGT GGTGGCCTCC AGCCACGGGC ACGAGGTCGC CTGGTCCCGG
CTGCCGTTCG GACGGCTGCT GGTCTCCACC GTCGGCGCGC GGGTCGACGT GCTGACCTAC
CTCACCGAGT TCACCCGCCG CCGGCTCGCG GCGGTCACCC CGCCGGGGAC CGAGCTGGCC
CGGCTCACCG GCGGCGTCGA CACCGAGCGG TTCCAGCCGG GCACCGGCGG GGACGAGATC
CGCCGGGGCC TGGGCTGGTC GGACGAGCCG GTCGTGATCT GCGTGGCCCG CCTCGTCACC
CGTAAGGGCC AGGACACGCT CATCCGGGGC TGGCACGACG TCCGACGCCG GCACCCGCAC
GCGCGGCTGC TGCTCGTGGG CGGCGGGCCC GCCGAGGACC GCCTGCGCCG CTTGGCCGCG
CGGGCCGGCG TCTCCGACGG TGTGCACTTC GCCGGTCCCG TGCCCGACGA GCTCCTCCCC
GCCTACCTCG ACGCGGCGGA CGTCTTCGCG ATGCCGTCAC GCACCCGGCT GTGCGGGCTC
GACCTGGAGG GCCTCGGGCT CTCCGCGCTC GAGGGGGCGG CCAGCGGCCT GCCGGTGATC
ACCGGCGCCC AGGGCGGCGC ACCGGACGTC GTCATCCCCG GCCGCACCGG CGTGGCCGTC
AACGGGCACG ACCGCACGGC CGTGGCCGCC GCCGTCATCG ACCTGCTCGA CGACCCGCGG
CAGGCGGAGC GCATGGGCGC GGCCGGCCGC GCGTGGATGC GGGCGGCGTG GAGCTGGGAG
ACGCTCAGCC TGCGCCTCGC CGGCATCCTC AGCGGGCAGG CCCCGACCGC CATCGGCGCC
GGCGCCGATG CGGCATGGAC CGCGGCAGAT GCCGAGGATG GGGCAGATGC CGTGGCCGGG
GCCCGGGCGG CTGTCGCGGT GGGCGGCGCC CGGGTGGGCA TGACGGTGCC GGAGCGCGGC
GATGGTCGTT GA
 
Protein sequence
MPRVLVVADK FPPTIGGIQT FACRLTAGLP PDRAVVLAPA QPGDAEFDRT LGFPVIRTEH 
GMITSPRGRR ELRAAVRAHG CEVAWFPTAA PLGVLAPVLR EAGVERVVAS SHGHEVAWSR
LPFGRLLVST VGARVDVLTY LTEFTRRRLA AVTPPGTELA RLTGGVDTER FQPGTGGDEI
RRGLGWSDEP VVICVARLVT RKGQDTLIRG WHDVRRRHPH ARLLLVGGGP AEDRLRRLAA
RAGVSDGVHF AGPVPDELLP AYLDAADVFA MPSRTRLCGL DLEGLGLSAL EGAASGLPVI
TGAQGGAPDV VIPGRTGVAV NGHDRTAVAA AVIDLLDDPR QAERMGAAGR AWMRAAWSWE
TLSLRLAGIL SGQAPTAIGA GADAAWTAAD AEDGADAVAG ARAAVAVGGA RVGMTVPERG
DGR