Gene Franean1_6554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6554 
Symbol 
ID5674869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7971427 
End bp7972947 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID641245403 
ProductO-antigen polymerase 
Protein accessionYP_001510797 
Protein GI158318289 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.408434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG ATTGCCTCGC CGATCCGGTC ACCCCGGGTA CCCGGCCGTG GGGCGGCGAG 
GCGCCCCGTT CTCGCCGCGG CGACATACTC GCCTCTCCGG TCAGCCAGAC GTTTGCCGCG
GTGGTGCTCG GCACCGCCGC GGTCGCCGCG GCGGTGCTGC GTGGCCCGGT CGGCGCCGCG
GCAGTGCTGG CCGTGCCCGC ACTGGTAGTA CCGCTGCTGC TCAGCCACCG GGACGCGGTG
AGCCTACTGA CGTTGTTCGT CTTCGCCCTC TTCTCGGTGC CGGTCTCCTA CCGGCTCGGG
CCCGCCGGTG CGCTCGCCGT CCCGATCGGC GTTGGTTGCC TGGCGTGCTG GCTCCGGCAT
CGAGCCGCAC CGCGTACCCG CCTCGACCCG GCGTTCCAAC CGGTGCGGGC GGCGACGCTC
GTGCTTCTGT GGTTGTTCAC TCTCAGCTTC GCGGTGGCGT TCACCCGGAT CACCACACCG
CTGGAGATCC GCTCGGCTGA GCGTTACCTC GTCATCGCCA CCGCGCAGAG CGGGGTGACG
CTGGTGGCCG CAGACATGAT CGGGAACCGG GCCAGGCTCG ACACCATCCT GCGCCGGATC
GTGCTGGGCG CGACCTTCAT GGCCGTCATC GGTGGAATCC AGTTTCTGAC GGATCGTGAC
TACAGCAGCA TCCTGGTGCC CCCGGGCATG TCGGTCGGCC CTTCGCCCCG CGAGGTGATC
GAGCTGCGTT CCGACTTCCG GCGGGTCGCC GGTACCGCCG GCCACCCGAT CGAGTTCGGG
GTCGTTCTGG CCATGATCCT GCCGTTGGCG CTGCACTACG CGTGCGTTAC TCGCGGGCAA
GCGGCGCGGG TCTGGGCGTG GGCGCAAGTC GCTGTGATCG GCGCCGCCAT CCCGACCAGC
ATCTCCCGAA GCGCGGTCCT GAGCTTTGCG ATCGGGATCA CGGCTTTCCT CACCGTCCGG
GGCTACCGGC GGATCCTGCA CGGCCTGCTG GCGCTGGCCC TGTTCCTGTT CATCTTGCAC
GAGATGTTTC CGGGCCTGCT GGAGGAAATC GTGTCGCTGT TCCTCGGCGC GAATAAGGAC
CCGAGTGTCG CCGGCCGCAC GGAAGACTAC GCGGCGGTGT GGGAACTGAT CCTGCGACGG
CCGTTTCTCG GCCTGGGAAT CGGCACGTTC ATCCCGCAGC AGTACTTCTT CCTCGACAAC
CAACTCCTCG GTTCAGTGCT GGAGACCGGC GTCCTCGGTA CCGTGGTCCT ACTGGGCTGG
CTCGCCGTTG GTCTGTCCGT GAGCCGCGGA GTTCGGCGCC GGGCGCGCAC AGCGCGCGAC
CGGGAACTCG GCCAGACACT GGTGGCCTCG ATTCTGGCGG GTTTCGCCGG CTTCCTCACC
TTCGACGCGC TCGGTTTCGC GATCTTCAGC GGCCTGCTGT TCCTACTGGT CGGATGCGCG
GGAGCGCTCT GGCGTATGAC GGCATCACCC GAAACACTCA CGCCATCGCC GCAGGCGCTG
GCCGCGACGG GCCGGTCATG A
 
Protein sequence
MTADCLADPV TPGTRPWGGE APRSRRGDIL ASPVSQTFAA VVLGTAAVAA AVLRGPVGAA 
AVLAVPALVV PLLLSHRDAV SLLTLFVFAL FSVPVSYRLG PAGALAVPIG VGCLACWLRH
RAAPRTRLDP AFQPVRAATL VLLWLFTLSF AVAFTRITTP LEIRSAERYL VIATAQSGVT
LVAADMIGNR ARLDTILRRI VLGATFMAVI GGIQFLTDRD YSSILVPPGM SVGPSPREVI
ELRSDFRRVA GTAGHPIEFG VVLAMILPLA LHYACVTRGQ AARVWAWAQV AVIGAAIPTS
ISRSAVLSFA IGITAFLTVR GYRRILHGLL ALALFLFILH EMFPGLLEEI VSLFLGANKD
PSVAGRTEDY AAVWELILRR PFLGLGIGTF IPQQYFFLDN QLLGSVLETG VLGTVVLLGW
LAVGLSVSRG VRRRARTARD RELGQTLVAS ILAGFAGFLT FDALGFAIFS GLLFLLVGCA
GALWRMTASP ETLTPSPQAL AATGRS