Gene Francci3_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1417 
Symbol 
ID3903398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1707364 
End bp1708872 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content72% 
IMG OID637878754 
ProductUDP-N-acetylmuramate--L-alanine ligase 
Protein accessionYP_480523 
Protein GI86740123 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0845142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTC AGCGCACCCA GCACGTCCAT TTCCTCGGCA TCGGAGGCAG CGGCCTGTCT 
CCGCTGGCCC AGATCCATCT CGCGGGCGGC GGGACGGTGA GCGGCAGTGA CTCCGAAGAC
TCGCCCCGGG TGGCGACCCT ACGCGCACGG GGGGTACCGA TCCGGATCGG GGCCACGCCC
GGTCCGGCCG CCTTCGCCGC CGAGCTCGCC GGTGCGGATG TCGTGGTCGC GTCCAGTGCG
CTGCCCGACG ACCATCCGGA GATCGTCGCC GCCCGCGCGC TCGGTCTGCC CGTGCGCCGC
CGTTCGGAAT GGCTACCGGA GCTCACCGCC GGCTACCGGC TGGTCGCCGT CGCCGGTTCG
CACGGCAAGT CGACCACCTC GGCGATGCTC ACCCTGGTCC TGCGCGCCGC CGGGCTGGAT
CCCACGGCGG TCATCGGCGC GGAGGTGTCA CAGCTGGGAG GGAACGCGCT GGCCGGCTCC
GGCGACGTCT TCGTCCTGGA GTCCGACGAG TACGGCGGCG CCTTCGCGGG CCTCGACCCG
AGCATCGCCG TCATCACCAA CGTCGAGTGG GAGCACCCTG ACGTCTTCCC CGACGAGGCA
TCGGTCCGGA CGGCCTTCGC CGCGTTCGCC CGACGGGTCC GGCCGGGTGG ACGACTGGTC
GTCTGCGGAG ATCATCCCGG GGTGGTCGCC GTCCTCACCG AGCTGGGTCG CCAGCGGCCC
GGCAATGACG TCGCGGTGAA TGACGTCGCG GTGATCGACT ACGGCTTCGG TGCCGAACGC
CACTGGCGCG CCGTCGACGT CGTCACCACC GCGGGGGACG ACATGACCAG GGCGACCGTC
CTGCGGGCCG GTCAGGAGAT CGGCGCGCTG ACCCTGACGG TACCCGGCCG GCACTCGGTG
CTGAACGCGC TCGCGGTCCT CGCCACCGCC ACCGAGCTGG GAGTCGCTCC CGCCCAGACC
CTCACCACCC TGACGACCTT CACCGGCGCG GCCCGACGGT TCGAGTTCGT GGGCTTCTGG
AACGGCCCCG CCGACAGCAC CGGCGTCGGA CCCGCCGGAG GTCCGGGCAG CCTGGAGGTG
ATCGACGACT ACGCGCATCA TCCGACGGAG GTCCGTCTGA CGCTCGCCGC GGCCCGTTCA
CGGGCTCGGG GACGGCAGAT CTGGACGGTA CTCCAGCCAC ATACGTTCAG TCGGTTCGCC
GCGCTACTGG ACGACTTCGC GGCCGCGTTC AGCGACGCCG ACCGGGTGTA CGTCACCGAC
ATCTATGCCG CCCGCGAGAC CGACGACCTC GGTCTGCATG CCGTCGACCT GGTCAAGCGG
GTGAGCGAGC CGGCGGCAAC ATACTACGTG TCCTGGCCCG AACTCGTCGA ACGACTCGCC
ACGGACGTCC GCGTCACCCT GAGCGACGAG GCGTCGCGGG GGATCCTGCT GCTCACGCTC
GGAGCGGGAA CGATCACCAC GGTCGGTCCC CGGCTGCTGG CCGCCCTGGG CTTCTCGGCG
GCCGGTTGA
 
Protein sequence
MTGQRTQHVH FLGIGGSGLS PLAQIHLAGG GTVSGSDSED SPRVATLRAR GVPIRIGATP 
GPAAFAAELA GADVVVASSA LPDDHPEIVA ARALGLPVRR RSEWLPELTA GYRLVAVAGS
HGKSTTSAML TLVLRAAGLD PTAVIGAEVS QLGGNALAGS GDVFVLESDE YGGAFAGLDP
SIAVITNVEW EHPDVFPDEA SVRTAFAAFA RRVRPGGRLV VCGDHPGVVA VLTELGRQRP
GNDVAVNDVA VIDYGFGAER HWRAVDVVTT AGDDMTRATV LRAGQEIGAL TLTVPGRHSV
LNALAVLATA TELGVAPAQT LTTLTTFTGA ARRFEFVGFW NGPADSTGVG PAGGPGSLEV
IDDYAHHPTE VRLTLAAARS RARGRQIWTV LQPHTFSRFA ALLDDFAAAF SDADRVYVTD
IYAARETDDL GLHAVDLVKR VSEPAATYYV SWPELVERLA TDVRVTLSDE ASRGILLLTL
GAGTITTVGP RLLAALGFSA AG