Gene Franean1_5302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5302 
Symbol 
ID5673636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6382350 
End bp6383492 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content74% 
IMG OID641244159 
Productglycosyl transferase family protein 
Protein accessionYP_001509566 
Protein GI158317058 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.641383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTTG TGCCTGTGAG CGTGGTCATT CCCGCTTACA ACGAGGCACT TCGTCTTCCG 
GCGTCCCTGC CGCGGCTGCT GGCTGTCGTG GGCAAGATCC CCAGGGCTGA GGTGATCGTC
GTCGACGACG GCAGCACCGA TGGCACCGCC GGGGTCGCCG AGGACCTGCT CGAGGGCTTT
CCGAACCACC GTGTGGTACG CCTGCCGTGG AACTGCGGAA AGGGCACCGC GGTAAGGGCG
GGCGTGTCGG CCGCGCATGG CCGGTCGATC GTCTTCATGG ACGCCGACGG GGCCTCCGAC
GTGAACGACC TGCCGTTGCT GCTCGCCGCG CTCGAGCACG CCGAGGTGGC GCTGGGCTCG
CGGCGAATCG GCGACGGAGC CACCCGGACA AGCGGCCGCA GGGCCGGTAG CTGGGCTTTC
AATCAGATTA CGCGTTCACT CGCGGCGCTG GACGTCGCTG ACACGCAGTG TGGCTTCAAG
GCGTTTCGGC ACGCGGAAGC CAAGATTCTT TTCAGTCTCG CGCGCTCCAC CGGCTTCGGA
TTCGACGTCG AGGTGCTCTC GATCGCGCGC TCGGTTGGCT ATCGCATCGC CGAGGTACCC
GTGCGCTGGG AGGAGACGCC CGGCGGCACC TTCCGGATCA CCCGGCACAC CCCCGCGATG
CTCGTCGACG TCGTCCGGGC CCGCCGCTAT CTCAGCCGGG TCGGGCTCCC GCCGGTCAGC
CGTCGCCAGC GGCTGGGCGA GCTCGGTGTC GTGGACGCGT CCGAGCTGCT CGGCCGGCCG
GCCACGCCGC GCGGTGCGGG GGAACCGCAG GGTGCCCCGC CCGGCCAGCT GCCGGTGCCC
GCGGCCCCCA CCCGGCCCGG CACACCGGCC CGGCCGACGG CCCGTACCCG GCCCGGCACA
CCGGCCCGGC CGGTGCCCGC CGCCCGGCCC ACGACCGCCG CCGCTCCCGC CCGGCCCGCG
GCCGTTCCCG CACTGCCCAT GACTCCCGCG CTGCCCGCCG CCCCCGCGCG GCCCCCCGCG
GTGCCCCGCC CCACGCCGCG TCCCGCTCCC GCGGTTCCAG CCGTGAGCGT GGCGCACGGG
ACCGGGCGGT TCACCCCGTC GCCCTCCCGC GGTGAGATCC CCGGTCCCGC GCCCGCACCG
TGA
 
Protein sequence
MDLVPVSVVI PAYNEALRLP ASLPRLLAVV GKIPRAEVIV VDDGSTDGTA GVAEDLLEGF 
PNHRVVRLPW NCGKGTAVRA GVSAAHGRSI VFMDADGASD VNDLPLLLAA LEHAEVALGS
RRIGDGATRT SGRRAGSWAF NQITRSLAAL DVADTQCGFK AFRHAEAKIL FSLARSTGFG
FDVEVLSIAR SVGYRIAEVP VRWEETPGGT FRITRHTPAM LVDVVRARRY LSRVGLPPVS
RRQRLGELGV VDASELLGRP ATPRGAGEPQ GAPPGQLPVP AAPTRPGTPA RPTARTRPGT
PARPVPAARP TTAAAPARPA AVPALPMTPA LPAAPARPPA VPRPTPRPAP AVPAVSVAHG
TGRFTPSPSR GEIPGPAPAP