Gene Francci3_1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1411 
Symbol 
ID3903392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1699854 
End bp1701257 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content75% 
IMG OID637878748 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_480517 
Protein GI86740117 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0779363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0360217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGC TGACCCTGGC CGAGGTCGCG TCGGCGACCG GCGGCCGGCT CGACGCGGTG 
CCCGACCCGG GCGTCACGGT GCGTTCGGTC GTCGTCGACT CCCGCGAGGT CCGCGACGGC
GCCTTGTTCG TCGCCCTGGC TGGCAGCCGG GTCGACGGGC ACGACTTCGC CGCCGCCGCC
GTCGCCGCCG GCGCGTCCGC CGTGCTCGCC GCCCGGTCGG TGGGCGAGCC GGCCGTCATC
GTTCCCGATC CGCCTTCGGC GCTCGCCGCG CTCGCCGCGT ACGTCCGTGA TCTGGCCGCC
GCCACCGTGG TCGCCGTGAC CGGGTCGGCG GGCAAGACGA CGACCAAGGA CCTGCTCGCC
GACGTGCTGG GCGGGCTCGC CCCGACGGTG GCCGCGCCCG GCTCCTTCAA CAACGAGATC
GGACTGCCGC TGACGCTGCT GCGCACCGAG CCGGACACGG CGTTCGTCGT CCTGGAGATG
GGTGCCCGCG GTCCGGGGCA CATCGCCACC CTCTGCGCGG TAGCCCGCCC GGCTGTGGGG
GTGGTCCTCA ACGTCGGCAG TGCCCACCTG GGCGAGTACG CCGACGGCCG GCTGGGGATC
GCCGCGGCTA AGGGCGAGCT TGCCGAGGCC GCGAGCGAGG CCGTCGTGCT CAACGCCGAT
GACCCGCTGG TCGCGGCGAT GGCGGTTCGG ACGACGGCCG AAGTGATCAC TTTCGGGGAG
GGTGGACGGG CCGACGTGCG GGCCGGTGCC GTCGATGTCG ACCGGCTGGG CCGCGCCTCG
TTCGACCTGC TGGCGCACGG CGAGCACCAT CGGGTGACCC TCGGGCTCGT CGGCGCGCAC
CAGGTGCCGA ACGCGTTGGC CGCGGCGGCC GTCGCGATCC GGCTGGGGCT GTCCCCGGAC
CGGGTCGCCG CGGCGCTGTC CGCCGCCCGC CCGCGCAGCC GGTGGCGGAT GGAGGTGACC
TCCACCGCGG CCGGGGTGGT GGTCGTCAAC GACGCCTACA ACGCGAACCC GGAGTCGATG
CGGGCGGCGC TGAAGGCGTT GGTGGACATG CGGGGGAAGG GCCGGGCGTT CGCGGTGCTC
GGTCCGATGG GGGAACTCGG TGACGCCGCC GCCGCCGAGC ACGACGTGCT CGGCCGGTTC
GCGGTCCGCC TCGGGGTCGA TCGACTGATC GCGGTGGGTC CGGCGGCCCG CCATATCCAC
CTGGGCGCCT CGCTGGAAGG CTCCTGGGAC GGGGAGTCGG TGGAGGTGAC CGACGCCGAG
GAGGCGGTGG CCCTGGTGGC CGCGCAGGCC GGACCGGACG ACGTGGTACT GGTCAAGGCC
AGCCGGTCCT TCGGTCTGGA GCGCGTCGCC GAGGCGTTGG TGACCAGATT CGGCGTCCTC
GGCGCCGGGA TCGAGGGGAC ATGA
 
Protein sequence
MIPLTLAEVA SATGGRLDAV PDPGVTVRSV VVDSREVRDG ALFVALAGSR VDGHDFAAAA 
VAAGASAVLA ARSVGEPAVI VPDPPSALAA LAAYVRDLAA ATVVAVTGSA GKTTTKDLLA
DVLGGLAPTV AAPGSFNNEI GLPLTLLRTE PDTAFVVLEM GARGPGHIAT LCAVARPAVG
VVLNVGSAHL GEYADGRLGI AAAKGELAEA ASEAVVLNAD DPLVAAMAVR TTAEVITFGE
GGRADVRAGA VDVDRLGRAS FDLLAHGEHH RVTLGLVGAH QVPNALAAAA VAIRLGLSPD
RVAAALSAAR PRSRWRMEVT STAAGVVVVN DAYNANPESM RAALKALVDM RGKGRAFAVL
GPMGELGDAA AAEHDVLGRF AVRLGVDRLI AVGPAARHIH LGASLEGSWD GESVEVTDAE
EAVALVAAQA GPDDVVLVKA SRSFGLERVA EALVTRFGVL GAGIEGT