Gene Franean1_5893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5893 
Symbol 
ID5674215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7156132 
End bp7157691 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content74% 
IMG OID641244742 
Productglycosyl transferase group 1 
Protein accessionYP_001510144 
Protein GI158317636 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.566479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.440344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC CCGGGCTGCG GGCGGCGGTC TACAACCGCT TCTGGCACTC GATGGGCGGC 
GGCGAGCGCC ACAACGGCAT GATCGCGCAG GTGCTGGCCG CGGACGGGCT CGACGTCGAC
CTGATCGGGC ATTCCGACGT CGACCTCGAC GCGCTCGGCA GCCACCTGGG CCTGGATCTC
TCCGGCTGCC GGTACCGGCG GCTGCCGGAC CGTGGCGAGG ACGCGATCGC CGCGCTGTCC
GAGTCGTACC ACCTGTTCGT GAACGGCTCG TACATGAGCC GGCTCGCCCC GCGCTCGCCG
CGGTCGGCCT ACCTGTGCTT CTTCCCGACG CCGTTCGACC ACGACATGGC CGCCTGGCGC
AAGGCCGCGG TGCGCACCGC CGGCCCGCTG CTGCGCGGGG TGAGCCCGGC CGTCAGCTTC
GGGCAGGGCT GGTACCCGCC CGAGGGCGGC CGGCGCCGGC AGTGGACGTG GACGAACGGC
GAGGGCATCC TGGCCGTCAA CCCGGGCGGC GGGCGTACCC TGCGCGCGGA CATCGGGCGT
CCCGGGGCGA GCGGTGGCGT GCGGCTGCGG GTCGTCGACG CGGACGGCAC CGTCCTCGCC
AAGCTCCAGA TCGAGCAGGC GTTCGCGCCG TTCGAGGTCA CCCTGCCGTC GTCGACCGCC
GGGACGGAGC TCACCTTCGT CAGCGACGTC TTCTCCCCCG GGCCGGCGGA CGTCCGCGAG
CTCGGTGTGG CCGTCAGCCG GCCGCGGGTC ACCGACGCCG ACGAGGGGCC GCTGGAGCGG
ATGGCGCTGC GTTTCCCCTG GCTGCTGCGC GACCCGGCCG ACCTCGGCTA CCTCGAGGGC
TACGACACCG TGATGGCCAA CTCCGAGTAC ACCCGGGGCT GGATCCGGCG GATGTGGCGG
CGGGACTCCG ACGTGCTGTT CCCGCCGATC CAGGTCGACC GGCTCACCCC GGCGCCGGAG
CGCGAGAAGG CCGTCATCAC CGTCGGCCGG TTCTTCGCCC CCGGCCTCGG CCATGCCAAG
CGGCAGCTCG AGATGGTGCA GTGGTTCGCC GAGCTGTACC GCTCCGGTGC GCTGCCCGAC
TGGCGGATGT ACGTCGTCGG CGGATGCGAG GACTCGCAGA AGCCGTACGT CGAGCAGGTC
CGCGCCGCCG GGGCGGGGGT GCCCGTCGAG GTCCTGCCGA ACGCCCCGCG CACCGAGGTG
GAGCGGCTGC TGTCGACCAG CTCGGTCTTC TGGTCCGCGA CCGGCTACGG CGAGGACGAC
CGCAAACGCC CCTGGACGGC GGAGCACTTC GGGATGACAA CCGTCGAGGC GATGGCCGGC
GGCTGCGTGC CCGTGGTCAT CGACCGTGCC GGGCAGCGCG AGATCGTCCG GCACGGGGTG
GACGGCTACC GCTGGACCGG CCCGGAGCAG GTCGCCTCGT TCACCCGCCG GCTCGCCGCC
GAGGACGGGC TGCGCTCCCG GCTGGCCGCC GCCTCCGTGC AGCGGGCCCA GCAGTTCTCC
GACGCCGCGT TCGCCGAGCG GTGGCACGGC ATCGCCGAGC AGCGCCGGCT CTACTCCTGA
 
Protein sequence
MTTPGLRAAV YNRFWHSMGG GERHNGMIAQ VLAADGLDVD LIGHSDVDLD ALGSHLGLDL 
SGCRYRRLPD RGEDAIAALS ESYHLFVNGS YMSRLAPRSP RSAYLCFFPT PFDHDMAAWR
KAAVRTAGPL LRGVSPAVSF GQGWYPPEGG RRRQWTWTNG EGILAVNPGG GRTLRADIGR
PGASGGVRLR VVDADGTVLA KLQIEQAFAP FEVTLPSSTA GTELTFVSDV FSPGPADVRE
LGVAVSRPRV TDADEGPLER MALRFPWLLR DPADLGYLEG YDTVMANSEY TRGWIRRMWR
RDSDVLFPPI QVDRLTPAPE REKAVITVGR FFAPGLGHAK RQLEMVQWFA ELYRSGALPD
WRMYVVGGCE DSQKPYVEQV RAAGAGVPVE VLPNAPRTEV ERLLSTSSVF WSATGYGEDD
RKRPWTAEHF GMTTVEAMAG GCVPVVIDRA GQREIVRHGV DGYRWTGPEQ VASFTRRLAA
EDGLRSRLAA ASVQRAQQFS DAAFAERWHG IAEQRRLYS