Gene Acid345_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3631 
Symbol 
ID4070151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4298946 
End bp4300307 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content60% 
IMG OID637985654 
ProductUDP-N-acetylmuramoylalanine--D-glutamate ligase 
Protein accessionYP_592706 
Protein GI94970658 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.916376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTC GAGGGAAGCG CGTACTGGTT GTCGGGTTAG GGAAGTCGGG CATTGCGTCG 
GCAACGTTCT TGCAGGCACA AGGCGCGAAG GTGACTGTTT CGGATTCGAA GTCGGAAGCG
CAGCTTCGCC AGGAAATTCC GCTCCTTCTC GACAAAGGCA TCACGGTCGA AACCGGCCAT
CACGGCGAGC GTACTTTCCG CGACCAGGAC TTGATCGTCA TCAGCCCCGG TGTTCCGTTC
GATCAGCCGC AACTCGAGCA GGCCCGCAAG CAGGGAATTC CCGTCATCGG CGAGATCGAA
CTCGCGGCCC AGTTTGTTCC CGGTCACGTC ATAGCCATCA CCGGCTCCAA CGGCAAGACG
ACCACGACCT CTCTCTGCGG CGACATTCTG CAATCCGGCG GCAAGAAGAC GCTTGTCGGC
GGCAACATCG GCACGCCGGC CATAAGCTTT GCCCAACTTG CCAATGACGA CACTTGGAGC
GTCCTCGAGA TTTCCAGCTT CCAGTTGGAG ACCATCGAGC GCTTCCGCCC GGAAATCGCG
GCGATCCTCA ACATCACGCC GGATCACCTC GATCGCCACG GGACCTTCGA GAAGTACGCC
GCCGCCAAGG AACGCATTTT CGAAAATCAG CGCGAGCACG ACTTCGCCAT CCTCAACGCC
GACAACGAAC CGTGCGTTGA GATCGCCAAG CGCGTGAAGT CGCAGGTGCT CTGGTTCTCG
CGGCAGCACG AAGTGAAGCA CGGCACCTTC GTCCGCGAAG ACAAGATCTA CTTCCGCGAT
CCCAAAGGCG AGCGCGAGAT CATGCCGGTT GCCGACATGC TGCTCAAAGG CGCGCACAAC
GTCGAGAACG TTCTCGCCGC AGTTTGTGTA GGCGTCGCCG CCAGTGTTGC GCCCGAGCAG
ATTCGCAAAG CTGTCTCGCA GTTCAAAGCC GTCGAGCATC GCCTCGAATA CACCGCCACC
GTCAAGGGCG TGGACTACTA CAACGACTCC AAAGCCACCA ACGTGGATGC GACCATCAAG
GCCCTCGAGT CCTTCAGCAA GGGTGTGCAC CTCATCCTCG GCGGCAAAGA CAAAGGCAGC
CCCTACACCG TGCTCAACGA TCTCCTCCAC GAGCGCGCCA AGACCGTGTA CACCATCGGC
GCAGCGGCAG CGAAGATCGA AGCCGAAGTA AAAGGTGTGG AAGTCGTCCA CGCCGAGACC
CTCGAAAACG CCGTGAAGCT TGCGTCGCAA AAAGCGGTGA AGGGCGACGT TGTCCTGCTC
GCGCCTGCCT GCGCCAGCTT CGACCAGTTC CAGAGCTACG AGCATCGCGG ACGCATCTTC
AAAGAGCTCG TCCGCAAGAT GGCAGAGCAG GAGAAGAAGT AG
 
Protein sequence
MDVRGKRVLV VGLGKSGIAS ATFLQAQGAK VTVSDSKSEA QLRQEIPLLL DKGITVETGH 
HGERTFRDQD LIVISPGVPF DQPQLEQARK QGIPVIGEIE LAAQFVPGHV IAITGSNGKT
TTTSLCGDIL QSGGKKTLVG GNIGTPAISF AQLANDDTWS VLEISSFQLE TIERFRPEIA
AILNITPDHL DRHGTFEKYA AAKERIFENQ REHDFAILNA DNEPCVEIAK RVKSQVLWFS
RQHEVKHGTF VREDKIYFRD PKGEREIMPV ADMLLKGAHN VENVLAAVCV GVAASVAPEQ
IRKAVSQFKA VEHRLEYTAT VKGVDYYNDS KATNVDATIK ALESFSKGVH LILGGKDKGS
PYTVLNDLLH ERAKTVYTIG AAAAKIEAEV KGVEVVHAET LENAVKLASQ KAVKGDVVLL
APACASFDQF QSYEHRGRIF KELVRKMAEQ EKK