Gene Franean1_5103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5103 
Symbol 
ID5673438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6108859 
End bp6110400 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content77% 
IMG OID641243954 
ProductUDP-N-acetylmuramyl-tripeptide synthetase 
Protein accessionYP_001509368 
Protein GI158316860 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID[TIGR01085] UDP-N-acetylmuramyl-tripeptide synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00121177 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0610764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC CGGCACTACG TCCGGCAGCG CCGTCACCGC GGTCGCTCAC CGCTCTGCCC 
GACGTCCTCA GATCCATCAC CGTCGCGCCC GGTCCGTCGG GCGGGGCGGC CCTCGCCGGT
GTCGTCCTCA CCGGCGTCAC CCACGACTCG CGCGCGGTGC TGCCGGGCGA CCTCTACGCG
GCGCTGCCCG GCGCGCACGT GCACGGTGCC GACTTCGCGG CCGGCGCGGT CGCCGCCGGC
GCCGTCGCCG TCCTCACCGA CCCGGTGGGC GCCGCGGCGC TGTCCGGCCG GCTCGCGGTG
CCCGTGCTCG TCACCGACGA CCCGCGCCGC GACCTGGCGC CGGCCGCCGC CTGGATCTAC
GGCGACCCGT CGAGCGCGCT GACCCTGCTC GGGGTCACCG GCACGAACGG CAAGACCACC
ACCGCCTTCC TCCTGGACGC CGGCCTGCGC GCCGCCGGGT ACACCACGGG CCTGCTCGGC
ACGGTGCAGA CCCGGGTCGC CGGAACGGTG GTGCCCTCGG TGCGCACGAC GCCCGAGTCC
GGCGACCTGC AGGCGCTGCT CGCGACGATG GTGGAGCGGG GCGTCGGGGC GGCGTCGATG
GAGGTGTCGA GCCACGCGCT GGCCCAGCAC CGGGTGGACG CGCTGCGCTT CGCCGGCGCC
GCCTTCACCA ACCTCAGCCA GGACCACCTC GACTTCCACC CGACGATGGA GGACTACTTC
GCGGCGAAGG CGGCACTGTT CGAGCCCGCC CGCAGCGCGG CCGCCGTGGT CTGCGTCGAC
GACGACTGGG GCCGCCGGCT CGCGGCGCTG CGCCCGGACG CGCGCACCTA CTCGGTGACC
GGCCGCCCGG CCGACTGGTG GCCGGAGGAC GTCGTCGCCG GGCCGGGCGG CAGCGTGTTC
CGCGCGCTGG GCCCGGACGG CGCGAAGGCG GACCTCTCGC TCGCCCTGCC GGGCCGGTTC
AACGTCGCCA ACGCGCTGGG CGCCCTCGCG CTGCTGGCCG CTGTCGGTGT TCCGCTGGAG
GCGGCGGCCG AGGGCATCTC GTCGCTGCCG GGGGTCCCGG GCCGGATGGA GCGGATCGAC
GCCGGCCAGC CGTTCCTCGC CCTCGTCGAC TACGCCCACA CCCCCGGCGC GGTCGAGACG
CTGCTCGCCA CCGTGCGCCC GATCGTGACC GGGCGGGTGG TCGTCGTCCT CGGCTGCGGT
GGGGACCGCG ACCGCGCCAA ACGGCCGCTG ATGGGCGCGG CCGCGGCGCG GCTGGCCGAC
CTGGCGGTGT TCACCAGTGA CAACCCCCGG TCCGAGGACC CGGCGCGCAT CCTCGAGCAG
ATGCTGGCCG GTGCCCGCGG CGCCCCCGGA GCGGGCGAGA TCGTTGTGGA GCCGGACCGG
GCGAGCGCGA TCGCGCTCGC GGTCGGCGCC GCCGGGCCCA CCGACGCCGT GATCGTGGCC
GGAAAGGGGC ACGAGAGCGG GCAGGACGTC GCCGGTGTCG TCACGCCGTT CGACGACCGC
GAGGTGCTGC GCTCCGCGCT GCTGGCGGCG GCGAGCCGAT GA
 
Protein sequence
MTTPALRPAA PSPRSLTALP DVLRSITVAP GPSGGAALAG VVLTGVTHDS RAVLPGDLYA 
ALPGAHVHGA DFAAGAVAAG AVAVLTDPVG AAALSGRLAV PVLVTDDPRR DLAPAAAWIY
GDPSSALTLL GVTGTNGKTT TAFLLDAGLR AAGYTTGLLG TVQTRVAGTV VPSVRTTPES
GDLQALLATM VERGVGAASM EVSSHALAQH RVDALRFAGA AFTNLSQDHL DFHPTMEDYF
AAKAALFEPA RSAAAVVCVD DDWGRRLAAL RPDARTYSVT GRPADWWPED VVAGPGGSVF
RALGPDGAKA DLSLALPGRF NVANALGALA LLAAVGVPLE AAAEGISSLP GVPGRMERID
AGQPFLALVD YAHTPGAVET LLATVRPIVT GRVVVVLGCG GDRDRAKRPL MGAAAARLAD
LAVFTSDNPR SEDPARILEQ MLAGARGAPG AGEIVVEPDR ASAIALAVGA AGPTDAVIVA
GKGHESGQDV AGVVTPFDDR EVLRSALLAA ASR