Gene Franean1_5879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5879 
Symbol 
ID5674202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7136630 
End bp7137817 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content75% 
IMG OID641244729 
Productglycosyl transferase group 1 
Protein accessionYP_001510131 
Protein GI158317623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.734858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCG GTTTCCTGGT CGAGCAGATT CTGGCGCCGA TCCCCGGGGG GACCGGAAGG 
TACTCCGCGG AGCTCGCCGC CGCCCTGGCC CGGACGGCGG ATCCCGGGGA CGGCGTCATC
GGGTGGTGTG CCTTCCGGCG TGACACCTCC GCGGCGGCCC TGCCGGGCGT CCTCGGGCCG
CTGCGCCTGG GCCTGCCGCG GCGGGCGCTC GCCGCCGCCT GGGCGCGCGG CGCGGGCCCG
GGCCCGCCCG GGGTGGACGT CGTCCACGCG CCGACGCTGC TCGTGCCCCC ACCCGTCCGG
CGGGACCGCG TCACCCGTGC CGCCCGTGCC CGGCCGCGGC TGGTGGTCAC CGTGCACGAC
GCCGTCCCGT GGACCCATCC CGGGACCCTG ACGCCGCACG GGGTCCGCTG GCACCGGGAG
ATGGGGGAGC GGGTCGCCCG CCACGCCGAC GCGGTGATCG TGCCGACCCG GGCCGTCGCG
GCTGACATCC GGGAGCATCT GCCGATTGAC GCGCAGCGGC TGCACGTGAT CGGTGAGGGG
GTCGCCGAGG CGGTGCTGCG GGTGCCGCCG GACGCCGACC GCCGGGCCGC CCGGCTGGGC
CTTCCCGAAC GGGGGTATTT GCTCACCCTG GCGACGATGG AGCCGCGCAA GGGCCTGGAC
ACCCTGCTTG CCGCCCTGCG CCATCCGGAC GCCCCCGACC TTCCGCTGGT GCACGTCGGG
GCTGCCGGAT GGGGTGATCT CGGCCCCGCC GCGACCGGGC CGGGAGGCTT CGCCGATCTT
GCCGCCTCCG GGCGGTTTCT TGGTCTCGGC CGGATCAGTG ACGAAGATCT CGCCGTCGTG
CTCTCGCGGG CGACGGTGCT GGTCGCGCCC AGCCGTTCCG AGGGCTTCGG CCTGCCGGTC
ATCGAGGCGA TGGCACACGG GGTGCCGGTG GTCGTCTCCG ACGCGCCGGC TCTTGTCGAG
GTCGCCGGCG ACGCGGCACT GGTGGCCCGG ATCGGTGATC CGGCCGGCTT CGCCGAGGCG
CTCGCGCGTA TCGTCCAGAA TCCCCGGCTA CACAGTCGTT TGTCGCGTTC CGGACGGGTC
CGCGCGGCGG GATATACCTG GAATGGGGCG GCGCGATCGT GCTGGGAGCT CTACCGTCGA
ATCAGCGGCT CCCCTGCCGT GTCCGTCGCC GACGACGGGC GGGCGTAG
 
Protein sequence
MRVGFLVEQI LAPIPGGTGR YSAELAAALA RTADPGDGVI GWCAFRRDTS AAALPGVLGP 
LRLGLPRRAL AAAWARGAGP GPPGVDVVHA PTLLVPPPVR RDRVTRAARA RPRLVVTVHD
AVPWTHPGTL TPHGVRWHRE MGERVARHAD AVIVPTRAVA ADIREHLPID AQRLHVIGEG
VAEAVLRVPP DADRRAARLG LPERGYLLTL ATMEPRKGLD TLLAALRHPD APDLPLVHVG
AAGWGDLGPA ATGPGGFADL AASGRFLGLG RISDEDLAVV LSRATVLVAP SRSEGFGLPV
IEAMAHGVPV VVSDAPALVE VAGDAALVAR IGDPAGFAEA LARIVQNPRL HSRLSRSGRV
RAAGYTWNGA ARSCWELYRR ISGSPAVSVA DDGRA