Gene Acel_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1008 
Symbol 
ID4485012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1113431 
End bp1114960 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID639729783 
ProductUDP-N-acetylmuramoylalanine--D-glutamate ligase 
Protein accessionYP_872767 
Protein GI117928216 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.330059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0128737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGG TGAGCGTCGT CCCCGAATGG TTGTCGGACG CAGGCCATGA CGCCCCCTGG 
TCGCAGCTCA CCGTCTGCGT CGCCGGAGTG GGGATCTCCG GCCGGGCCGC GGCCCGGGTG
CTGGCCGCCC TCGGTGCCCA GGTCATCGCC GTCGATGACC GGGACGGCGA ACCCGAGCGT
GCAGCGGCCG CTGAGTTGGC CAGGCTCGGC GTCACCGTCC GGTTGGGGGA CGGCGCGACG
TTCCCGGACG GCGTCCAGCT GATCGTGACC TCGCCGGGGT GGCGGCGGGA ATCCCCACTC
TTCGCCGCAG CTGCGGACCG GGGTGTGCCG GTCTGGGGGG AGCCTGAGCT CGCCTGGCGG
TTGCGTCGTC CCGGGGACGC CGAGTGGCTT GTGATCACGG GAACGGATGG CAAGACGACG
ACGACGCTCA TGCTGGAGTC GATCCTGCGG GCCGCCGGGC TGCGGACGAT AGCGGCAGGC
AACATCGACG TGCCGCTGGT CGAGGTCGTC AACTCCGGTT ACGACGTTCT TGCCGTGGAA
CTCGGCAGTT TTCAGCTGCA CTGGTCCCCA TCCGTCGCGC CGAAGGCTGC GGCCGTCCTG
AACGTTGCTC CTGATCACCT CGACTGGTGG GGCGGATCGT TCACCGATTA CGCCAACGCT
AAGGGCCGGG CGTTCGCCCA TCCGCGGACC TGCGCGATCG GCAACCTGGA CGACCCGCAG
ACGGTCCGGT TGCTGGCCCG CGCACCGGGG CGGCGCGTCG GATTCACCCT GCACTCACCC
CGGCCCGATC AGGTCGGAGT TCACGACGGC GTCCTCCTTG ATCGAGCGTT CGTTCCCGAC
CCGGCTCGCG ACGTCGTCGA GCTTGCGACG GTCACCGACA TTCCCGTTCC CGGCGCGCAC
AACGTTGCCA ATGCGCTCGC CGCCGCCGCG CTGGCCCGTT CGATCGGGGT TGAGCCGGCG
GCGATCGCGG CCGGCTTGCG CACGTTCACC CCGGCGGCGC ACCGGATCGC GACGGTCGCC
GAGGTGGACG GCGTCCGGTT CGTCGACGAC TCCAAGGCGA CCAGCCCGCA CGCTGCGGCG
GCCTCGCTCA CCTCGTTCGA CCGGATCGTG TGGATCGCCG GCGGCCTGGG CAAGGACGTC
GCCTTTGACG AGCTCGTCAG CCAGGTGGCC GATCGGCTGC GCGGCGTCGT TCTGCTCGGA
GCATGCCGGC ATGAGATCGC CGATGCTCTC CGGCGACACG CCCCGCAGGT GCCGGTCATC
GACGTCGGCG GGGCCGAGAC TGGGGACGTG CACGCCGTTC TCGATGCCGC CGTTGCGGCG
GCCGTCCGCT ACGCCGCGCC GGGCGACGTC GTCCTCCTGG CACCGGCCGC CGCGTCCTAC
GACATGTTCC GCGACTACCG GCATCGCGGC CAGGCGTTCG CGGATGCCGT CCGCCGCTAC
GCCGAGCGCC GTTCGGCGGC GGAGCGACGT GAGGCGGCTG CGCAGGTTGG ACCGTCCGGG
CCGGCGGAGA GCGCGGGCGG CTCCCGGTGA
 
Protein sequence
MARVSVVPEW LSDAGHDAPW SQLTVCVAGV GISGRAAARV LAALGAQVIA VDDRDGEPER 
AAAAELARLG VTVRLGDGAT FPDGVQLIVT SPGWRRESPL FAAAADRGVP VWGEPELAWR
LRRPGDAEWL VITGTDGKTT TTLMLESILR AAGLRTIAAG NIDVPLVEVV NSGYDVLAVE
LGSFQLHWSP SVAPKAAAVL NVAPDHLDWW GGSFTDYANA KGRAFAHPRT CAIGNLDDPQ
TVRLLARAPG RRVGFTLHSP RPDQVGVHDG VLLDRAFVPD PARDVVELAT VTDIPVPGAH
NVANALAAAA LARSIGVEPA AIAAGLRTFT PAAHRIATVA EVDGVRFVDD SKATSPHAAA
ASLTSFDRIV WIAGGLGKDV AFDELVSQVA DRLRGVVLLG ACRHEIADAL RRHAPQVPVI
DVGGAETGDV HAVLDAAVAA AVRYAAPGDV VLLAPAAASY DMFRDYRHRG QAFADAVRRY
AERRSAAERR EAAAQVGPSG PAESAGGSR