Gene CPR_2180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2180 
Symbol 
ID4205523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2405651 
End bp2406775 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content26% 
IMG OID642566730 
Productputative monogalactosyldiacylglycerol synthase 
Protein accessionYP_699480 
Protein GI110803050 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.351092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TTCTTATTTT AAGTACATCT ACTGGTTATG GTCATAATCA AGCAGCTAAC 
TCTTTAATGG AACTTATAAA GAATGATGAT ACAGAAATTC TTGTACACGA TTTTCTTAAG
GAGAACCGTT TCTTTGATAG ATCAATAGTA AATGGATACG ATTTATGTGC AAGTTCCCTA
GGAACTCTTT ATGGTCTACT TTACAAAATT AGTAATATAA AATTTATAAA TAATTTAGTT
AGTTTTTTAT TTTTGCCAGT TGCAAATAAA TTAGTTAAAT TCATTCATAG TTTTAATCCA
GATTTAATAA TTACAACCCA TCCTTTAGCC GTAAGTATTC TTTCTTATTT AAAGAAAAGA
CAAATTATAA AAGTACCAGT TATTTCTGTT GTTACTGATT TTAAATGTCA TTATACTTAT
GTTTCTAAAA TAATTGATCA TTACATAGTT GCTTGTGATT TTACAAAAGA AAATCTAGCC
TCAAAGGGAA TACCTAAAGA AAGAATATCT CCTTTTGGTA TACCTGTAAA GCAAGATTTT
TATAAAGAAG ATTACCATAA TTATATAGAG AATATTATCC AATCACCATT AAATATTCTT
TTAATGGGAG GCGGTATGGG GTTAGATAAT ATATCTAAAG TATTAAAAAC CCTTATCAAA
AATGATAACC CTTTAAATTT AACTATTGTT TGTGGAAACA ATGCTGAGTT AAAGAAAGAA
TTGTGTAAAG AATATGGGCA TATAACGGGT AATAAAAATT TAAATATTTT AGGATACACA
ACTGAAATAC CTAAGATAAT GAAAAGTTCG GATTTAATTA TTACAAAACC TGGTGGACTA
ACTACTACAG AATCACTTTT AAGTCATTTA CCAATGATAA TTCCATTTAT AATTCCAGGT
CAAGAAAGTG AAAATAGAGA ATTTTTATCT AAAAGTAATT GTGCAATTAC TATCAACCAC
TTGGAAGAAT TAAATAAGGT AATTAATGAT TTAAATAAAG ATAATAATAA ATTAATTAAT
ATGAGACAAA GTATTTTAGA TGTATTAAGT TCTTATTCTC CAGAAGAAAC AATTAAACTT
TGCACTAAGA TGCTAAATGA TTCTTATAAC AAAAGAAGAT ATTAA
 
Protein sequence
MKKVLILSTS TGYGHNQAAN SLMELIKNDD TEILVHDFLK ENRFFDRSIV NGYDLCASSL 
GTLYGLLYKI SNIKFINNLV SFLFLPVANK LVKFIHSFNP DLIITTHPLA VSILSYLKKR
QIIKVPVISV VTDFKCHYTY VSKIIDHYIV ACDFTKENLA SKGIPKERIS PFGIPVKQDF
YKEDYHNYIE NIIQSPLNIL LMGGGMGLDN ISKVLKTLIK NDNPLNLTIV CGNNAELKKE
LCKEYGHITG NKNLNILGYT TEIPKIMKSS DLIITKPGGL TTTESLLSHL PMIIPFIIPG
QESENREFLS KSNCAITINH LEELNKVIND LNKDNNKLIN MRQSILDVLS SYSPEETIKL
CTKMLNDSYN KRRY