Gene Cagg_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3054 
Symbol 
ID7269471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3714254 
End bp3715786 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content59% 
IMG OID643567874 
ProductUDP-N-acetylmuramyl-tripeptide synthetase 
Protein accessionYP_002464348 
Protein GI219849915 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID[TIGR01085] UDP-N-acetylmuramyl-tripeptide synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000380955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAAAA CACTCGCCGA TTTGTTGTCT GGGGTATCCG TGTACCATCT GATCGGTGAC 
CCTGCTACGC CGATCAGTGC TCTTGTCTAC GACTCGCGCC GGGTGACGCC GGGGAGTCTG
TTTGTCGCGA TTCGTGGGCA GCACAGCGAT GGCCATAACT TTATCCCACA AGCGATTGCT
GCCGGAGCGG CAGCAGTGGT CGTCGACCAG CGGTACTGGC ATGGTGCGGT GTCGGCTGAA
GTGCCGGTCG TGGTTGTTGC CGACAGTCGG GTGGCGCTGG CGCCTTTGGC TGCCGCCTTT
TACGAGTATC CGGGGCAACA GTTGACGACC ATCGGGATTA CCGGCACCAA AGGGAAGAGT
ACGACCACCG ACCTGACGGC TCAGCTCTTG GCTGCTGCTG GGCGTACTGT AGGAATGATC
AGTACGGTTG ATTTTCAGAT CGGTACGCGG CGCTGGCCGA ATGACACCCG CCAAAGTACA
CCGGAAGCGC CAGAAGTACA GGCCCTCTTG CGCGAAATGG TTGTCGCCGG GTGCGATACA
GCCGTGATTG AGGCTACTTC TCACGCCCTC TCGCCGCGTT GGGGGCGGTT GGTGGGTTGT
GCCTTTGCGG TGGCGGTGAT GCTCAACATC GGTCACGAAC ATCTCGATTT TCACGGTACC
TTTGAGCAAT ACCGGGCCGA CAAAGCCCAA CTCTTTGCCC TGCTTGCCGA ACGTCATGGC
CCGACGTGGG CTATTGTGAA CGCCGATGAT CCACATCACG GCACGTTCCT GGCTGCTGTA
CCGTCGCAAA CGATGCGATT ACGCTATGCC TTGCACGCTC CCGCCGACGT GCAAGGTCAG
ATTGTCCACA GTGGGCCGGC AAGTAGCCAC ATGCGGATAC ATTCACCGTG GGGCGAAATT
GAAGTGAACG TGCCGCTTCC CGGACGTTTC AATGCCAGTA ACGTACTGGC CGCACTCACC
GTTGCCCTTA CGCAGGGTGT GCCGCTCGAA CGTGCTGCTG CCGCTGTCGC TCATGTCCGT
GCCCCACGCG GGCGGATGGT GTCGATCAAT GTCGGTCAGC CGTTTACCGT AATCGTCGAT
TACGCGCATA ATCCCGATTC GTTTGAGCAA ATCTTCACCA TGCTTCGCCC ACAGGTGACA
GGTCGAATAA TCGCTGTGTT TGGAAGCGCC GGCGAGCGCG ATGTGAGCAA ACGCGCCATT
CAAGGCGAGA TTGCCGGGCG TATGTGTGAT TTGCTCGTGT TGACCGATGA AGACCCGCGT
GGCGAAGATC GTGAGGCGAT CATTGCCCAG ATCGCTGCCG GCGCCGAACG AGCCGGTAAA
CGACCCGGGA GTGGTTACCT TTGTATTCCC AACCGGGCAC AAGCGATAAG GACGGCCATC
GCTGCCGCTC GGCCCGGTGA TATGGTGTTG CTGTTAGGCA AAGGTCACGA AGGTAGTATC
ATCTACGCCG ATTATACGCT GCCGTGGGAT GAAGAAGGTG AAGCGCGACG GGCGCTGGCC
GAATTAGGCT ATCATGCTAA AGAGCAAGCA TGA
 
Protein sequence
MPKTLADLLS GVSVYHLIGD PATPISALVY DSRRVTPGSL FVAIRGQHSD GHNFIPQAIA 
AGAAAVVVDQ RYWHGAVSAE VPVVVVADSR VALAPLAAAF YEYPGQQLTT IGITGTKGKS
TTTDLTAQLL AAAGRTVGMI STVDFQIGTR RWPNDTRQST PEAPEVQALL REMVVAGCDT
AVIEATSHAL SPRWGRLVGC AFAVAVMLNI GHEHLDFHGT FEQYRADKAQ LFALLAERHG
PTWAIVNADD PHHGTFLAAV PSQTMRLRYA LHAPADVQGQ IVHSGPASSH MRIHSPWGEI
EVNVPLPGRF NASNVLAALT VALTQGVPLE RAAAAVAHVR APRGRMVSIN VGQPFTVIVD
YAHNPDSFEQ IFTMLRPQVT GRIIAVFGSA GERDVSKRAI QGEIAGRMCD LLVLTDEDPR
GEDREAIIAQ IAAGAERAGK RPGSGYLCIP NRAQAIRTAI AAARPGDMVL LLGKGHEGSI
IYADYTLPWD EEGEARRALA ELGYHAKEQA