Gene Franean1_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1673 
Symbol 
ID5670075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2001345 
End bp2002580 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content69% 
IMG OID641240591 
Productglycosyl transferase group 1 
Protein accessionYP_001506017 
Protein GI158313509 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.272276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATCG CTTTCCTATG CGAGCAGTAC CCACCGATCA TCTGGGATGG CGCTGGCGTC 
TACACGCACG ACATCGCTCA CGCACTGGTT CGGCGTGGCC ATGAGGTTCA TGTGCTGTGC
ACCCAGGGCC GCCGTATCCG CGACGACGTG TTCGACGGCG TTCACGTTCA CCGCCGCCCG
CTGTTGCGTG CGCCTGTCAC TCGATACCTT GGGCCGGCAG CAAAGCTGAT CAACGGCCGG
GATCATCCGC GTGACTCCTT GTCGCTGCGG GCCTCACTGG CGGTGTCGTA CGGCTTCTGG
CTTCGGCAGA GCGGGATCAA CCCGGACGTC ATCGAGACAC AGGACGGCGA GACACGGGGC
CTGTTCACGG CTCTCGGTCG GCGTACGCCG CTGGTGATCC ACCTGCACAC GCCGACGATG
ATGGACGTCC GTCTGCGGGA TCCCGAACTG AGCCGCAAGG GGGAGCTGGC CGACCGCATC
GACCGCTTCT CGGCGCTGCG AGCGGACGCG CGGACTTCCC CGTCCCAGCT CCTCGTCGAC
ACGCTGCACG AGCTCGACTG GCTGCGACCG GACACCGACG TCGACGTCAT CCCGTACCCG
TTCGACAACG TGCCGTTCGC GAGCGTCCCC ACGGCAGAGC ACACCGGACC GAACCTGGTC
GTGGTCGGCC GGCTGGAGTG GCGCAAAGGG CTGGACGTGC TCGTCGAGGC CGCGTCCCGG
CTACTGGCGC GGGGAGTCGA GGCCAAGCTG ATCTTCGTCG GGCAGTCGTC CGGTCGGATC
GAAGGCGTCG AGACCGGGGC ATGGCTCGAG CGGAAGGCCG CTGAGCTGGG CGTTCCGGTC
CGGTTCGAGG GGCATGTCTC CCGCACGGAG CTTCCGGCGC TCTACGGTGA GGGCCGGGCG
GTCGTTGTGC CGAGCCGGTT CGAGAGCTTC TCCATCGCGG GCCTTGAGGG GATGGCCGCC
GCTCGCCCGG TGGTCGCCAC AGCGACGACC GGCGTCTCGA CCTGGGTCGA CCGCTGGAAG
GGCGGCGCCG TCGTGCCGCC GGAGGACCCG GAGGCGATGG CGGACGCTCT CGAGCCGTTC
CTGACCGACC AGGACCACGC GGCCGTCGTC GGCCTGCGTG GTCGGATGGG CACCGCTGAG
CTGGATCCGG CGCGCATCGC CGAGCGCCGC GAGGAGGTCT ACCTCAAGGC GATCGCGCGT
CATGAGGTCC GCCGGCCCGA GAGGCAGCGG GGATAG
 
Protein sequence
MRIAFLCEQY PPIIWDGAGV YTHDIAHALV RRGHEVHVLC TQGRRIRDDV FDGVHVHRRP 
LLRAPVTRYL GPAAKLINGR DHPRDSLSLR ASLAVSYGFW LRQSGINPDV IETQDGETRG
LFTALGRRTP LVIHLHTPTM MDVRLRDPEL SRKGELADRI DRFSALRADA RTSPSQLLVD
TLHELDWLRP DTDVDVIPYP FDNVPFASVP TAEHTGPNLV VVGRLEWRKG LDVLVEAASR
LLARGVEAKL IFVGQSSGRI EGVETGAWLE RKAAELGVPV RFEGHVSRTE LPALYGEGRA
VVVPSRFESF SIAGLEGMAA ARPVVATATT GVSTWVDRWK GGAVVPPEDP EAMADALEPF
LTDQDHAAVV GLRGRMGTAE LDPARIAERR EEVYLKAIAR HEVRRPERQR G